Uses of a thyx polypeptide or a nucleic acid encoding such a polypeptide, in particular for screening anti-bacterial or anti-viral compounds

ABSTRACT

The invention is relative to the various uses of a novel enzyme family, referred to as THYX, capable of catalyzing the synthesis of thymidine 5′-monophosphate in the absence of an active thymidylate synthase enzyme (ThyA).  
     The invention also relates to reaction media for the thymidylate synthase activity of a THYX polypeptide as well as screening methods implementing said reaction media, as well as kits for implementing such methods.

FIELD OF THE INVENTION

[0001] The present invention relates to the field of enzymes involved in the DNA synthesis, and more specifically to the synthesis of an intermediary compound, thymidine 5′-monophosphate (dTMP), required for producing thymidine 5′-triphosphate (dTTP) constituent for the DNA molecule.

[0002] It is also relative to various uses of a new enzyme family, referred to as THYX, capable of catalyzing the synthesis of thymidine 5′-monophosphate in the absence of an active thymidylate synthase (ThyA) enzyme.

[0003] The invention also relates to reaction media for the thymidylate synthase activity of a THYX polypeptide as well as screening methods implementing said reaction media, as well as kits for implementing such methods.

PRIOR ART

[0004] Deoxythymidylate, unlike other deoxynucleotides, is not directly produced by a ribonucleotide reductase. Thymidine 5′-monophosphate (dTMP) is disclosed in the state of the art as being the final product of methylation of uridin 5′-monophosphate (dUMP), said methylation being catalyzed by a thymidylate synthase of the THYA type, as well in eukaryotes as in bacteria (Carreras and Santi, 1995). The thymidylate synthase THYA, which is also expressed in mammals, has been contemplated as a potential target of DNA synthesis inhibiting compounds, being useful more particularly for treating colorectal cancer (Papamichael, 2000).

[0005] A simplified illustration of the dTMP production within the cell through dUMP methylation is shown in Scheme 1 hereinunder.

[0006] In Scheme 1, there is illustrated the synthesis of dTMP in bacteria and all the eukaryotes through reducing methylation of dUPM under the action of thymidylate synthase THYA.

[0007] Another component, essential for dTMP synthesis, is the dihydrofolate reductase enzyme (DHFR), the catalytic activity of which is necessary for recycling dihydrofolate through the formation of an active cofactor, the CH₂H₄ folate, acting as a methyl donor. The thymidylate synthase could be inactivated by a fluoro-dUMP and the DHFR can be inactivated more particularly by amethopterin and aminopterin and trimethoprim, being folate derivates (see Scheme 1).

[0008] Many research works done on the synthesis route of the pyrimidine compounds all showed that the intracellular synthesis of dTMP could only be catalyzed by a single enzyme, the thymidylate synthase coded by the thyA gene (see Annual. Reviews on Biochemistry, 1995, vol. 64:721-762; Nature Reviews Mol. Cell. Biol., 2001, vol. 2, 147-151).

[0009] Moreover, studies performed on thymidylate synthase inhibitors for treating advanced colorectal cancer point out that <<a significant feature of the nucleotide metabolism is the duplication of the metabolic routes; the inhibition of any enzyme can be bypassed through one or more alternative routes. However, a noticeable exception to this rule is the thyA thymidylate synthase, which is an unavoidable enzyme representing the only way to add a methyl group in position 5 of the pyrimidine cycle in the de novo synthesis of thymidine (Papamichael, 1999).

SUMMARY OF THE INVENTION

[0010] Surprisingly, it has been shown according to the invention that a polypeptide family, referred to as THYX, having a structure completely distinct from the thyA gene coded polypeptides, are able to synthesize dTMP within the cells.

[0011] In particular, it has been shown according to the invention that cells with a gene coding a THYX polypeptide have a thymidylate synthase activity, in the absence of thyA gene. It has also been shown that introducing a copy of a thyX gene, coding a THYX polypeptide within an auxotrophic cell for the thymidine could enable to restore, within such a cell, the capacity to de novo synthesize dTMP and finally the thymidylate.

[0012] It has also been shown according to the invention that the thyX genes can also be found in the genome of numerous bacteria, bacteriophages and bacterial viruses or eukaryotes, whereas they are absent from the mammalians' genome, more particularly from the human genome.

[0013] The characterization according to the invention of a new synthesis route of dTMP by means of the THYX protein family has made available, for the first time, to those skilled in the art the numerous applications directly derived therefrom and which are set forth herein. The various uses of a THYX polypeptide or a nucleic acid coding a THYX polypeptide set forth herein are technically linked by virtue of the common functional and structural features of the THYX polypeptides as hereinunder defined.

[0014] An object of the invention is the use of a THYX polypeptide comprising the following amino acid sequence:

[0015] X₁HR(X)₇S, wherein:

[0016] X₁ represents the amino acid R (Arginine or Arg) or K (Lysine or Lys), and

[0017] (X)₇ is a chain with seven consecutive amino acids wherein each X represents, independently from each other, any one of the 20 naturally occurring amino acids,

[0018] in an in vitro synthesis method for the thymidine 5′-monophosphate (dTMP),

[0019] S represents the Serine amino acid or Ser according to the one letter code in accordance with the international nomenclature.

[0020] Preferably, X₁ represents the amino acid R.

[0021] Preferably, the THYX polypeptide is selected amongst polypeptides comprising the amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.

[0022] The invention is also relative to using a nucleic acid coding a THYX polypeptide such as defined hereinabove for producing said THYX polypeptide.

[0023] Preferably, the nucleic acid is selected amongst nucleic acids comprising the nucleotidic sequences SEQ ID N^(o)44 to SEQ ID N^(o)64.

[0024] The invention also relates to using a THYX polypeptide such as defined hereinabove in a method for screening thymidylate synthase inhibiting compounds, more particularly anti-bacterial or anti-viral compounds.

[0025] It is also relative to using a nucleic acid coding a THYX polypeptide such as defined hereinabove in a method for screening thymidylate synthase inhibiting compounds, including anti-bacterial or anti-viral compounds.

[0026] Another object of the invention is also the use of an antisense oligonucleotide specifically hybridizing with the messenger RNA coding a THYX polypeptide in order to in vitro inhibit the DNA synthesis within a bacterium or a virus.

[0027] It is also relative to using a nucleic acid coding a THYX polypeptide such as defined hereinabove as a selection marker for a genetic recombination event.

[0028] It also relates to using a specific probe or nucleotidic primer for a nucleic acid coding a THYX polypeptide in order to detect a bacterium or a virus, more particularly a pathogenic bacterium or virus for mammals, and more specifically for man, as well as a set or kit for detecting such a bacterium or such a virus, comprising a specific probe or nucleotidic primer for a nucleic acid coding a THYX polypeptide.

[0029] Additionally, the Applicant's works relating to the ThyX activity have made it possible to identify the enzyme mechanism and thereby optimize the reaction conditions enhancing the importance of adding some compounds within the reaction medium.

[0030] The aim of the invention is therefore the use of such media and the application thereof.

DESCRIPTION OF THE FIGURES

[0031]FIG. 1 illustrates an alignment of amino acid sequences of various THYX polypeptides.

[0032] In the left column is listed the name of the organisms from which each of the amino acid sequences is derived.

[0033] The numbers identify the order number of the amino acid at the beginning of the corresponding lineage, within the THYX polypeptide sequence of the organism.

[0034]FIG. 2A illustrates immuno-imprint gels made on cellular extracts obtained from E. coli X2913 (ΔthyA) bacteria transfected with the ThyX gene of P. abyssi (lanes number 1 and number 2) or E. coli X2913 (ΔthyA) bacteria transfected with the thyX gene of H. pylon (lanes number 3 and number 4).

[0035] The transforming E.coli bacteria were cultivated in the absence of Arabinose (lanes number 1 and 3) or in the presence of 0.2% Arabinose (lanes number 2 and 4). The visible bands on the immuno-imprint gels (3 Western blot>>) respectively correspond to the THYX polypeptide of P. abyssi (lane number 2) and to the THYX polypeptide of H. pylon (lane number 4).

[0036]FIG. 2B illustrates photographs of Petri dishes being seeded:

[0037] on the left part, by an E. coli Chi2193 (ΔthyA) bacterium transformed with the thyX gene of H. pylori; and

[0038] on the right part, with E. coli Chi2193 (ΔthyA) bacteria transformed with the thyx gene of P. abyssi.

[0039] The left photograph shows the growth results for the transformed E. coli bacterium cultivated on gelose (agar) in a M9 minimum medium added with 0.2% Arabinose. Only the E. coli bacteria transformed with the thyX gene of H. pylori multiply.

[0040] The right photograph shows the same bacteria cultivated in gelose in the presence of a M9 minimum medium in the absence of Arabinose. No bacterial growth is observed, whichever the transfected bacterium is.

[0041]FIG. 3 shows a migration on SDS PAGE gel colored with Comassie Blue. The first well corresponds to the PROMEGA Mid-Range marker. The arrows show the molecular weights corresponding to the marker bands. The second well contains the sample of our protein; after being purified on a Ni-NTA column, its band is at about 28 kDa.

[0042]FIG. 4 is a Western Blot of the fractions after purification of the protein on a Ni-NTA column, which shows THYX purification.

[0043] M=PROMEGA Mid-Range marker,

[0044] TF=the first fraction after incubation of the protein with the resin,

[0045] L=fraction after washing, and

[0046] E1=eluted protein.

[0047]FIG. 5 shows biochemical analyses of the ThyX of H. Pylori.

[0048]FIG. 6 illustrates the formation activity for dTMP of the ThyX protein of H. pylori.

DETAILED DESCRIPTION OF THE INVENTION

[0049] It has been shown for the first time according to the invention that a polypeptide family, referred to as THYX, distinct from the polypeptide family coded by the thyA genes, has a catalytic activity of the thymidylate synthase type.

[0050] The Applicant cloned the thyX gene of H. Pylori in a functional expression vector in Escherichia coli. Such an expression vector has been used for transforming an auxotrophic Escherichia coli strain for thymidine, more specifically the E. coli X2913 (ΔthyA572) strain, the genetic material of which is precisely characterized and wherein the thyA gene is deleted. The results being set forth in the examples show that the expression of the thyx gene which has been artificially introduced into the X2913 (ΔthyA) strain has made it possible to restore the ability of E coli to de novo synthesize dTMP. Because of the large knowledge gathered on the features of the E. coli genome as well as on the normal synthesis route of dTMP via the expression of the THYA thymidylate synthase, the so-obtained results show that the expression product for the thyx gene of H. Pylori directly overcomes the production deficiency of the THYA polypeptide in such a bacterial organism.

[0051] Through a sequence homology study, it is shown according to the invention that the polypeptides having sequences homologous to the THYX polypeptides of Helicobacter pylori (SEQ ID n^(o)21) were also coded by the genome of numerous bacteria, Archaebacteries bacteriophages as well as in some viruses.

[0052] Surprisingly, it is also shown according to the invention that all the bacteria, Archae bacterie and viruses having in their genome a copy of a thyX gene do not have simultaneously the thyA gene, previously known as the only gene able to perform the dTMP synthesis. Such organisms having the thyx gene in the absence of the thyA gene do not include either any copy of the tdk gene coding the thymidine kinase required for an intracellular incorporation of exogenous thymidine which is subsequently converted within the cell into dTMP, with the noticeable exception of the Mycobacterium tuberculosis bacterium. The simultaneous presence of both the thyA and thyX genes in M. tuberculosis is probably due to a gene transfer event.

[0053] The mutual exclusion of the thyA and thyx genes in the genome of the above-mentioned organisms clearly shows that thyX compensates for the thymidylate synthase function that is no longer ensured in the absence of thyA. Additionally, only the thyA gene is systematically absent from the genome of bacteria having the thyxgene, while other genes involved in the nucleotide metabolism are undiscriminately present or absent from the genome of such organisms.

[0054] It is also shown according to the invention that thyA deficient bacteria wherein the thyX gene is present are mostly pathogenic bacteria for mammals. Are more specially to be mentioned Campylobacterjejuni causing poisonings food intoxications, Helicobacter pylori being a causal agent for ulcers, Rickettsia prowazekii being a causal agent for typhus, Borrelia burgdorferi being involved in Lyme's disease, Treponema pallidum being the causal agent for syphilis, bacteria of the Chlamydiae genus which are compulsory intracellular pathogens as well as eukaryotic DNA viruses such as Chorella virus.

[0055] It has also been shown according to the invention that bacteria wherein the thyX gene has been inactivated become auxotrophic for thymidine, i.e. they only multiply if exogenous thymidine is added to the culture medium.

[0056] All the proteins belonging to the THYX family having a newly identified thymidylate synthase activity according to the invention share in common the structural and functional features as indicated hereinunder.

[0057] Structural Features

[0058] a) The thyx genes code, all without exception, a THYX polypeptide comprising the following amino acid sequence:

[0059] X₁H R(X)₇ S, wherein:

[0060] X₁ represents the amino acid R (Arginine or Arg) or K (Lysine or Lys), and

[0061] (X)₇ is a chain with seven consecutive amino acids wherein each X represents, independently from each other, any of the 20 naturally occurring amino acids.

[0062] Preferably, X₁ represents the amino acid R.

[0063] X₁ represents K, particularly for the THYX polypeptide coded by the Roseophage S101 genome.

[0064] b) As clearly shown by the alignment of the amino acid sequences of the THYX proteins from various origins illustrated in FIG. 1, the THYX polypeptides have a preserved serine amino acid residue. Moreover, the results from the mutagenesis experiments made on the thyX gene of Helicobacter pylori, as set forth in the examples, show that the preserved serine amino acid is indispensable for the catalytic activity of the THYX polypeptide, since substituting respectively a cysteine or alanine residue for the serine residue leads to the production of a polypeptide being unable to compensate for the THYA polypeptide deficiency in the E. coli X strain of 2913 (ΔthyA).

[0065] c) The THYX polypeptides do not have any cysteine amino acid residues being preserved in their sequences, unlike the polypeptides coded by the thyA genes where the preserved cysteine residue has a nucleophilic essential part in the methylation reaction catalyzed by the THYA polypeptides.

[0066] d) Using the BLAST software (version 2.0) with the default parameters and then the Psi-BLAST module in order to perform iterative cycles of homology research with the THYX polypeptide sequence of H. Pylori as <<lure>>, no target sequence of thyA thymidylate synthase has been selected, showing the lack of homology between thya and thyX genes.

[0067] Functional Features

[0068] a) A THYX polypeptide according to the invention is able to catalyze the oxidation of methylene tetrahydrofolate into tetrahydrofolate, which is a feature of the catalytic activity of the thymidylate synthase type, and shows that the THYX polypeptides belong to the enzyme class of the thymidylate synthase type;

[0069] b) the growth of microoganisms transformed by a thyX gene could be inhibited by a high trimethoprim concentration, being an inhibitor specific for the dihydrofolate reductase; the fact that only high trimethoprim concentrations inhibit THYX suggests that there are functional differences between the metabolic formation routes of thymidylate where are respectively involved thyA and thyX;

[0070] c) bacteria wherein the thyX gene has been inactivated become auxotrophic for thymidine;

[0071] d) a THYX polypeptide according to the invention is only catalytically active in the presence of a co-factor of flavin type. It is to be noted that the polypeptides coded by the thyA genes, previously known as being the only genes coding enzymes of the thymidylate synthase type, are active without requiring the presence of flavin.

[0072] Are included in the polypeptides belonging to the THYX polypeptide family having structural and functional features as defined hereinabove the THYX polypeptides comprising the amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.

[0073] The THYX polypeptides of the sequences SEQ ID n^(o)1 to SEQ ID n^(o)37 were made available to the public specially through their publication in data bases of amino acid sequences.

[0074] An object of the invention is the use of a THYX polypeptide comprising the following amino acid sequence:

[0075] X₁ HR(X)₇ S, wherein:

[0076] X₁ represents the amino acid R (Arginine or Arg) or K (Lysine or Lys), and

[0077] (X)₇ is a chain with seven consecutive amino acids wherein each X represents, independently from each other, any of the 20 naturally occurring amino acids,

[0078] in an in vitro synthesis method for the thymidine 5′-monophosphate (dTMP).

[0079] Preferably, the amino acid X₁, represents the amino acid R.

[0080] In order to achieve the in vitro synthesis of dTMP by means of a THYX polypeptide according to the invention, those skilled in the art could more particularly refer to the examples herein, wherein the thymidylate synthase activity of the THYX polypeptide of Helicobacter pylori (SEQ ID N^(o)21) is shown in cellular extracts through the detection of the oxidation reaction of the methylene tetrahydrofolate compound.

[0081] Preferably, the above-mentioned use is characterized in that the THYX polypeptide is selected amongst polypeptides comprising the amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.

[0082] According to another aspect, the above-mentioned use is characterized in that the THYX polypeptide is selected amongst polypeptides comprising the amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.

[0083] The THYX polypeptide with amino acid sequences SEQ ID N^(o)5 is coded by a gene derived from the Dictyostelium discodideum organism which was disclosed in 1989 by Dynes and Firtel, and referred to as <<Thy1>> by these authors.

[0084] However, DYNES and FIRTEL explicitly excluded that the Thy1 gene of Dictyostelium discoideum, could code a thymidylate synthase. The biosynthesis route for thymidine by Dictyostelium discoideum, which is still not known heretofore, was obviously unknown in 1989, as well as were also unknown the molecular bases causal for the thymidine autotrophy. In addition, the Dictyostelium discoideum organism has not yet been the subject of systematic sequencing studies of its genome. A fortiori, in 1989, no data was available regarding the characterization of the genetic material of such organism, which is still an insurmountable technical barrier for identifying the direct functional part of a mutation, more particularly a mutation leading to an alteration of a metabolic route as complex as that of nucleotides, in particular the thymidine nucleotide. In fact, DYNES and FIRTEL did not characterize the nature of the mutation in Dictyostelium discoideum. The function of the DNA insert of the clone allowing for complementing the organism so as to restore the autotrophy through thymidine was totally unknown. Additionally, since 1989, the numerous research teams for studying the cellular biosynthesis route of dTMP continued to gather experimental results showing that the thymidilate synthase coded by the thyA genes would be the only synthesis route for dTMP (above-mentioned D. PAPAMICHAEL The Oncologist, 1989).

[0085] The invention is also relative to using a nucleic acid coding a THYX polypeptide such as defined hereinabove with a view to producing said THYX polypeptide for implementing it in the various uses of a THYX polypeptide as disclosed herein.

[0086] Starting from amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37 and/or nucleotidic sequences SEQ ID N^(o)44 to SEQ ID N^(o)64, those skilled in the art are able to detect, isolate, clone and characterize any nucleic acid coding a THYX polypeptide such as defined hereinabove, for example synthetizing nucleotidic probes specific for a nucleic acid coding the X₁ HR(X)₇ S or RHR(X)₇ S peptide.

[0087] For making such probes, those skilled is the art could adapt their sequence depending on the use of the codon for a given organism. The detection of a thyX gene could be achieved through hybridation on a DNA gel (<<Southern Blot>>) as well as through PCR amplification, for example using the above defined probe as a nucleotidic primer.

[0088] For example, the thyX gene of H. Pylori could be isolated by means of the nucleotidic primers with sequences SEQ ID N^(o)38 and SEQ ID N^(o)39 and the thyx gene of P. abyssi could be isolated by means of the nucleotidic primers with sequences SEQ ID N^(o)40 and SEQ ID N^(o)41. The thyX gene of Campylobacter jejuni could be isolated by means of the nucleotidic primers with sequences SEQ ID N^(o)42 and SEQ ID N^(o)43.

[0089] Preferably, the nucleic acid coding a THYX polypeptide is selected amongst the nucleic acids coding a THYX polypeptide comprising one of the amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.

[0090] According to another aspect, the nucleic acid is selected amongst the nucleic acids coding a THYX polypeptide consisting in one of the amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.

[0091] Advantageously, the nucleic acid is selected amongst the nucleic acids comprising the nucleotidic sequences SEQ ID N^(o)44 to SEQ ID N^(o)64.

[0092] Amongst the THYX polypeptides able to be implemented according to the invention are included the THYX polypeptides having at least 95% amino acid identity with a THYX polypeptide selected amongst sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.

[0093] Amongst the nucleic acids coding a THYX polypeptide able to be implemented according to the invention is included a nucleic acid having at least 95% nucleotide identity with a nucleic acid selected amongst nucleotidic sequences SEQ ID N^(o)44 to SEQ ID N^(o)64.

[0094] Identity between two Nucleic Acids or between two Polypeptides

[0095] For the purpose of the present specification, the expression <<nucleotidic sequence>> is used for undiscriminately referring to a polynucleotide or a nucleic acid. The expression <<nucleotidic sequence>> encompasses the genetic material itself and hence, is not restricted to the information regarding the sequence thereof.

[0096] According to the invention, a first nucleic acid having at least 95% identity with a second reference nucleic acid, would have at least 95%, preferably at least 96%, 97%, 98%, 98.5%, 99%, 99.1%, 99.2%, 99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8% or 99.9% nucleotide identity with said second reference polynucleotide, the identity percentage between two sequences being determined as described hereinunder.

[0097] According to the invention, a first polypeptide having at least 95% identity with a second reference polypeptide, would have at least 95%, preferably at least 96%, 97%, 98%, 98.5%, 99%, 99.1%, 99.2%, 99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8% or 99.9% amino acid identity with said second reference polypeptide, the identity percentage between two sequences being determined as described hereinunder.

[0098] The “identity percentage” between two nucleotide or amino acid sequences, as meant in the present invention, could be determined comparing two optimally aligned sequences, through a comparison window.

[0099] The part of the nucleotidic or polypeptidic sequence in the comparison window could hence comprise additions or deletions (for example “gaps”) as compared to the reference sequence (which does not comprise such additions or such deletions) so as to obtain an optimal alignment of the two sequences.

[0100] The percentage is calculated by determining the number of positions where an identical nucleic base or a amino acid residue is observed for the two sequences (nucleic or peptidic) to be compared, then dividing the number of positions where there is an identity between the two amino acid bases or residues to be compared, by the total number of positions in the comparison window, then multiplying the result by one hundred so as to obtain the sequence identity percentage.

[0101] The optimal alignment of sequences for the comparison could be achieved with a computer using known algorithms.

[0102] Preferably, the sequence identity percentage is determined using the BLAST software (version BLAST 2.06 dated September 1998), exclusively using the default parameters.

[0103] A nucleic acid having at least 95% nucleotide identity with a nucleic acid according to the invention encompasses “variants” of a nucleic acid according to the invention.

[0104] As used herein a nucleic acid “variant” according to the invention means a nucleic acid differing from the reference nucleic acid through one or more substitutions, additions or deletions of a nucleotide, compared to the reference nucleic acid. A variant of a nucleic acid according to the invention could be from natural origin, such as a naturally occurring allelic variant. Such a variant nucleic acid could also be a non-naturally occurring nucleic acid obtained, for example, using mutagenesis techniques.

[0105] Generally, the differences between the reference nucleic acid and the “variant” nucleic acid are reduced such that the reference nucleic acid and the variant nucleic acid have very similar nucleotidic sequences and in numerous regions identical. The nucleotidic modifications present in a variant nucleic acid could be silent, meaning that they do not affect the amino acid sequence which could be coded by such a variant nucleic acid.

[0106] The nucleotide modifications in such a variant nucleic acid could also result in substitutions, additions or deletions of one or more amino acids in the sequence of the polypeptide that could be coded by such a variant nucleic acid.

[0107] More preferably, a variant nucleic acid according to the invention comprising an open reading phase, codes a polypeptide maintaining the same biological function or the same biological activity as the polypeptide coded by the reference nucleic acid.

[0108] Most preferably, a variant nucleic acid according to the invention and comprising an open reading phase, codes a THYX polypeptide maintaining the catalytic activity of a thymidylate synthase, which could more particularly be detected by the ability of the THYX polypeptide to oxidize the methylene-tetradihydrofolate in vitro as well as to restore the ability of a Thya thymidylate synthase deficient bacterium or eukaryote cell to synthesize DNA, as described in the examples.

[0109] Much more preferably, a variant THYX polypeptide will not comprise any amino acid modification on the X₁HR(X)₇S pattern, and the preserved Serine amino acid is present.

[0110] In the THYX polypeptides able to be implemented according to the invention are included the THYX polypeptides coded by a nucleic acid such as defined hereinabove.

[0111] Are also included in the definition of a THYX polypeptide according to the invention the THYX polypeptides comprising one or more amino acid substitutions in one of the sequences SEQ ID N^(o)1 to SEQ ID N^(o)37 by an <<equivalent>> amino acid. Are included in the definition of “equivalent” amino acids, the amino acids belonging to the same class, such as the acidic (D and E), basic (K, R and H), non polar (A, V, L, l, P, M, F and W) as well as non charged polar (G, S, T, C, Y, N and Q) amino acids.

[0112] Are also within the scope of the invention the polypeptides referred to as “homologous” to any of the THYX polypeptides of the amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37, or the variants thereof.

[0113] Such homologous polypeptides have amino acid sequences with one or more substitutions of an amino acid by an equivalent amino acid, as compared to the reference polypeptides.

[0114] It is meant by equivalent amino acid according to the present invention, for example the substitution of a residue in the D form for a residue in the L form as well as the substitution of a pyro-glutamic acid for a glutamic acid (E) according to techniques well known to those skilled in the art. By way of illustration, the synthesis of a peptide containing at least one residue in the D form is disclosed by KOCH (1977).

[0115] According to another aspect, are also considered as being equivalent amino acids two amino acids belonging to the same class, i.e. two acidic, basic, non polar as well as non charged polar amino acids.

[0116] Preferably, the polypeptides according to the invention comprising one or more additions, deletions, substitutions of at least one amino acid maintain their ability to be recognized by antibodies raised against the unmodified polypeptides. Such polypeptides also maintain their thymidylate synthase catalytic activity.

[0117] Preferably, a THYX polypeptide or a nucleic acid coding a THYX polypeptide able to be implemented according to the invention is in isolated or purified form.

[0118] The term “isolated” as used herein means a biological material which has been removed from its original environment (the environment where it is naturally occurring). For example, a polypeptide or a polynucleotide naturally occurring in an animal or a plant is not isolated. The same polypeptide separated from its natural environment or the same polynucleotide separated from the adjacent nucleic acids wherein it is naturally inserted in the genome of the animal or of the plant is isolated.

[0119] Such a polynucleotide could be included in a vector and/or such a polynucleotide could be included in a composition and could yet remain in the isolated state, because the vector or the composition is not its natural environment.

[0120] The term “purified” does not require that the material should be present in absolute purity form, excluding the presence of other compounds. It is rather a relative definition. A polypeptide or a polynucleotide is in a purified state after purification of the starting material of at least one order of magnitude, preferably 2 or 3 and more preferably 4 or 5 orders of magnitudes.

[0121] Are also within the scope of the invention, THYX polypeptides with sequences SEQ ID N^(o)1 to SEQ ID N^(o) 37 made proteolysis resistant through the introduction of one or more non peptidic links, such as a reduced link (CH₂NH), a retro-inverso link (NHCO), a methylene-oxy link (CH₂—O), a thiomethylene link (CH₂—S), a carba link (CH₂—CH₂), a ketomethylene link (CO—CH₂), a hydroxyethylene link (CHOH—CH₂) as well as a CH═CH link.

[0122] In all cases, a THYX polypeptide able to be implemented according to the invention maintains the catalytic activity of a thymidylate synthase, which could be more particularly detected by the ability of the THYX polypeptide to oxidize the methylene-tetradihydrofolate in vitro as well as to restore the ability of a THYA thymidylate synthase deficient bacterium or eukaryotic cell to synthesize the DNA, as disclosed in the examples.

[0123] Production of a THYX Polypeptide able to be Implemented According to the Invention

[0124] The invention is also relative to a method for producing one of the THYX polypeptides as defined hereinabove, in particular a polypeptide selected amongst the THYX polypeptides with amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37 or of a variant thereof, said method comprising the steps of:

[0125] a) inserting a nucleic acid coding said polypeptide into an appropriate vector;

[0126] b) cultivating, in an appropriate culture medium, a host cell previously transformed or transfected with the recombinant vector from step a);

[0127] c) recovering the conditioned culture medium or lysing the host cell, for example through sonication or osmotic shock;

[0128] d) separating and purifying said polypeptide from said culture medium or also from cell lysates obtained in step c);

[0129] e) if need be, characterizing the produced recombinant polypeptide.

[0130] The THYX polypeptides according to the invention could be characterized through an attachment on an immunoaffinity chromatography column where the antibodies raised against such a polypeptide or against a fragment or a variant thereof have been previously immobilized.

[0131] According to another aspect, a recombinant THYX polypeptide according to the invention could be purified through passing on an appropriate plurality of chromatography columns, using methods known to those skilled in the art.

[0132] A THYX polypeptide according to the invention could also be prepared using the traditional chemical synthesis techniques, either in a homogenous solution or in a solid phase.

[0133] By way of illustration, a THYX polypeptide according to the invention could be prepared using the technique or in a homogenous solution as disclosed by HOUBEN WEYL (1974) or also the solid phase synthesis technique as disclosed by MERRIFIELD (1965a; 1965b).

[0134] Methods for Screening Thymidylate Synthase Inhibiting Compounds

[0135] As already previously set forth, the thyX genes are found in numerous pathogenic bacteria for mammals, in particular being pathogenic for man, and in some viruses. Additionally, unlike the thyA genes, the thyx genes are not to be found in the mammals' genome. More particularly, the human genome is thyx gene free.

[0136] As a result, THYX polypeptides and nucleic acids coding THYX polypeptides are preferred targets for compounds specifically inhibiting the expression of thyx genes or specifically inhibiting the thymidylate synthase activity of THYX polypeptides. Such inhibiting compounds are able to inhibit the DNA synthesis in numerous bacteria, bacteriophages and viruses, more particularly those being pathogenic for mammals, including for man, as well as the multiplication of such bacteria, bacteriophages and viruses, while not causing unwanted effects in such mammals, or at least only causing very little unwanted effects in mammals, including man, whose genome does not comprise any copy of a thyx gene.

[0137] Method for Screening THYX Polypeptide Inhibiting Compounds

[0138] Another object of the invention is also the use of a THYX polypeptide such as defined in the description in a method for screening anti-bacterial or anti-viral compounds.

[0139] According to a first embodiment of a method for screening an anti-bacterial or anti-viral compound according to the invention a THYX polypeptide such as defined in the description could be used for screening molecules being attached thereon.

[0140] The attachment of the polypeptide to the molecule or substance can or not inhibit (antagonist molecule) the thymidylate synthase activity of said polypeptide.

[0141] Such molecules able to attach themselves to any of the polypeptides according to the invention comprise antibodies, oligonucleotides, other proteins and generally speaking, small molecules of any nature.

[0142] In such a screening test, the attachment of the candidate molecule to the polypeptides could simply be shown, one of both partners being labeled with a detectable compound (polypeptide of interest or candidate molecule), wherein the THYX polypeptide/candidate molecule complex is then visualized through the detection of the detectable marker, after removal of the non specifically linked candidate molecules.

[0143] By way of example, a screening test of a candidate molecule able to attach itself to a polypeptide according to the invention could advantageously comprise a first step where the polypeptide of interest or the candidate molecule is immobilized on a substrate, a second step where the second partner (candidate molecule or polypeptide of interest) is brought in the presence of the first compound previously immobilized on the substrate, a third step where one or more cleaning operations are performed in conditions appropriate for removing compounds being not specifically linked, and finally a fourth step where the complex optionally formed between the polypeptide of interest and the candidate molecule is detected.

[0144] In the embodiment of the screening test where the candidate molecule is previously immobilized on a substrate and subsequently brought in the presence of the polypeptide of interest according to the invention, the detection of the complex formed by the candidate molecule and the polypeptide of interest according to the invention could be advantageously preformed using an antibody such as described hereinabove.

[0145] In another embodiment of the screening test where the polypeptide of interest according to the invention is previously immobilized on a substrate, the candidate molecule will be advantageously labeled using a detectable marker prior to its contact with the immobilized polypeptide of interest.

[0146] Such a detectable marker could be radioactive or non radioactive, for example, fluorescent or could correspond to a ligand for a third partner used for detection like a biotin molecule.

[0147] Consequently, another object of the invention is also a method for screening a candidate molecule or substance interacting with a polypeptide according to the invention, said method comprising the steps of:

[0148] a) contacting a polypeptide in accordance with the invention with the candidate substance or molecule to be tested;

[0149] b) detecting the complexes optionally formed between said polypeptide and said candidate substance or molecule.

[0150] The invention is also relative to a set or a kit for screening a candidate molecule or substance interacting with a polypeptide according to the invention, said set comprising:

[0151] a) a polypeptide in accordance with the invention;

[0152] b) if need be, means required for detecting the complex being formed between said polypeptide and the candidate molecule or substance.

[0153] Method for Screening Anti-bacterial or Anti-viral Compounds in an Acellular System

[0154] According to such other method for screening anti-bacterial or anti-viral compounds according to the invention, the test for inhibiting the thymidylate synthase activity of a THYX polypeptide is carried out in a cellular system, for example in a cell culture lysate expressing the THYX polypeptide and which does not simultaneously express any polypeptide coded by a thyA gene.

[0155] The cells from which the cell lysate is obtained are preferably cells having the same features as those implemented in the method for screening in an acellular system being described more in detail earlier in the specification.

[0156] In short, the cells from which the cell lysate is obtained are respectively:

[0157] either cells naturally expressing THYX in the absence of THYA;

[0158] or cells in the genome of which the thyA gene has been inactivated and which are transfected with a recombinant vector expressing a thyx gene.

[0159] The cell lysate could be obtained from a culture of the cells as described hereinabove, for example, through sonication or by osmotic shock, according to techniques well known to those skilled in the art.

[0160] After the cell lysis step as such, the cell fragments could be removed in a centrifugation step at the end of which such cell fragments are to be found in the pellet, the centrifugation supernatant comprising, amongst others, all the proteins including the THYX protein, and wich is recovered for implementing the screening method.

[0161] According to the screening method in an acellular system, the thymidylate synthase activity is quantified respectively in control samples only containing the cell lysate and in test samples containing a candidate inhibiting compound, if need be, for a plurality of increasing concentrations of the candidate inhibiting compound.

[0162] Preferably, a plurality of test samples will be implemented, comprising a given candidate inhibiting compound, at increasing concentrations.

[0163] The thymidylate synthase activity could be quantified, more particularly, through detecting the oxidation of methylene-tetrahydrofolate, as described in the examples.

[0164] Still another object of the invention is a method for screening an anti-bacterial or anti-viral compound in vitro in an acellular system, characterized in that said method comprises the steps of:

[0165] a) preparing a cell lysate from a culture of cells expressing a THYX polypeptide in the absence of a polypeptide coded by a thyA gene;

[0166] b) adding to the cell lysate obtained in step a) the inhibiting compound to be tested;

[0167] c) comparing the thymidylate synthase activity respectively in the cell lysate as obtained in step a) and in the cell lysate as obtained in step b); and

[0168] d) selecting the candidate compounds for which some inhibition of the thymidylate synthase activity has been detected.

[0169] A further object of the invention is also a kit or a set for screening a thymidylate synthase inhibiting compound characterized in that it comprises:

[0170] a) a composition comprising a THYX polypeptide in solution or in lyophilized form;

[0171] b) optionally one or more reagents required for quantifying the thymidylate synthase activity.

[0172] According to a first embodiment, the composition containing the THYX polypeptide comprises a cell lysate prepared as described hereinabove.

[0173] According to a second embodiment, the composition comprising the THYX polypeptide comprises an amount of the THYX polypeptide in purified form adapted for the obtention of an assay sample in solution comprising a concentration of the THYX polypeptide ranging from 10⁻¹⁰ to 10⁻²M, preferably from 10⁻⁸ to 10⁻³M and most preferably, from 10⁻⁷M to 10⁻⁵M.

[0174] Method for Screening Anti-bacterial or Anti-viral Compounds in a Cell System

[0175] The invention also relates to using a nucleic acid coding a THYX polypeptide such as defined in the description in a method for screening anti-bacterial or anti-viral compounds.

[0176] Preferably the nucleic acid codes a THYX polypeptide selected amongst polypeptides comprising the amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.

[0177] Most preferably, the nucleic acid is selected amongst the nucleic acids comprising the nucleotidic sequences SEQ ID N^(o)44 to SEQ ID N^(o)64.

[0178] The invention is also relative to a kit or a set for screening an anti-bacterial or anti-viral compound, characterized in that it comprises:

[0179] a) a recombinant expression vector comprising a nucleic acid coding a THYX polypeptide such as defined in the present description, under the control of a functional promoter in a host cell wherein its expression is being sought or in a host cell transfected with such a recombinant vector;

[0180] b) optionally one or more reagents required for quantifying the thymidylate synthase activity.

[0181] According to a second embodiment of a method for screening anti-bacterial or anti-viral compounds according to the invention, the activity of the thimydylate synthase activity inhibiting compounds of THYX could be tested in cell cultures expressing THYX, in the absence of thyA expression.

[0182] According to a first aspect, such a screening method could be implemented on cell cultures for the genome having a copy of the THYX gene, but no thyA gene, as for example Campylobacter jejune, Helicobacter pylori, Rickettsia prowazekii, Borrelia burgdorferi or Chlamydia cultures.

[0183] According to a second aspect, such a method for screening anti-bacterial or anti-viral compounds in a cell system could be implemented using cell cultures wherein the thyA gene has been inactivated and which have been transfected by a nucleic acid or a recombinant vector expressing a thyX gene in such cells, as for example the E.coli n^(o)X 2913 (ΔthyA) strain which has been transfected with an expression vector coding a THYX polypeptide.

[0184] Still another object of the invention is a method for screening an anti-bacterial or anti-viral compound characterized in that it comprises the steps of:

[0185] a) cultivating cells expressing the thyX gene in the absence of expression of the thyA gene in an appropriate culture medium;

[0186] b) contacting the cells with a candidate compound to be tested; and

[0187] c) selecting the candidate compounds inhibiting the thimydylate synthase activity of the polypeptide coded by the thyX gene.

[0188] The compounds selected by means of the above-mentioned screening method are those for which some inhibition of the thymidylate synthase activity is observed in the cells, as opposed to the thymidylate synthase activity as observed in the control cell cultures which are not put in the presence of candidate compounds to be tested.

[0189] The quantification of the thymidylate synthase activity could be achieved for example by incorporating a radiolabelled uracil in the DNA of the cultured cells, using techniques well known to those skilled in the art, the amount of radiolabelled dTTP in the DNA reflecting the thymidylate synthase activity level in the cell culture. In such a case, the cells as cultivated in step a) of the method are incubated in the presence of a radiolabelled uracil, for example (³H)-uracil or (¹⁴C)-uracil.

[0190] The radioactivity of dTTP in the DNA is measured after hydrolysis of DNA purified using standard techniques (radioactivity counter).

[0191] After lysis of the cells, for example, through sonication or osmotic shock, the cell lysates are filtered on a nitrocellulose membrane retaining the DNA, and thereafter the radioactivity contained on the filter is measured using an adapted radioactivity counter.

[0192] In the embodiment of the above-mentioned screening method, where the cultivated cells consist in cells having their genome which does not comprise any active copy of a thyA gene and which have been transfected with a recombinant vector comprising a DNA insert coding a THYX polypeptide, the recombinant vector will be selected so as to allow for the expression of the THYX polypeptide in the host cell being cultivated during the method.

[0193] Examples of recombinant vectors useful for implementing the above-described screening method are detailed hereinafter.

[0194] Recombinant Vectors able to be Used According to the Invention

[0195] The invention also relates to the use of a recombinant vector comprising a nucleic acid coding a THYX polypeptide such as defined hereinabove or a variant of such a polypeptide.

[0196] Advantageously, such a recombinant vector will comprise a nucleic acid selected amongst the following nucleic acids:

[0197] a) a nucleic acid coding a polypeptide having an amino acid sequence selected from the group of sequences SEQ ID N^(o)1 to SEQ ID N^(o)37 or a variant of such a polypeptide, optionally merged with a heterologous polypeptide;

[0198] b) a nucleic acid comprising a polynucleotide selected amongst sequences SEQ ID N^(o)44 to SEQ ID N^(o)64, or a variant of the latter;

[0199] c) a nucleic acid coding a THYX polypeptide having at least 95% amino acid identity with a polypeptide selected amongst the group consisting of the sequences SEQ ID N^(o)1 to SEQ ID N^(o)37 or a variant of the latter;

[0200] d) a nucleic acid having at least 95% nucleotide identity with a nucleic acid selected from the group consisting of the sequences SEQ ID N^(o)44 to SEQ ID N^(o)64 or a variant of the latter.

[0201] It is meant by “vector” herein a circular or linear DNA or RNA molecule being undiscriminately in the form of a single strand or a double strand.

[0202] Are preferred the expression vectors comprising, beside a nucleic acid coding a THYX polypeptide in accordance to the invention, regulatory sequences making it possible to direct the transcription and/or the translation thereof.

[0203] According to an advantageous embodiment, a recombinant vector according to the invention will more particularly comprise the following elements:

[0204] (1) regulatory elements for the expression of the nucleic acid to be inserted, such as promoters and enhancers;

[0205] (2) the coding sequence comprised in the nucleic acid in accordance to the invention to be inserted into such a vector, said coding sequence being arranged in phase with the regulatory signals described in (1); and

[0206] (3) appropriate transcription initiation and stop sequences.

[0207] Moreover, the recombinant vectors according to the invention could comprise one or more replication origins in the cell hosts wherein their expression is being sought, one or more selection markers.

[0208] By way of examples, the bacterial promoters could be the Lacl, LacZ promoters, the promoters of the RNA polymerase of the T3 or T7 bacteriophage, the PR or PL promoters for the lambda phage.

[0209] The eukaryote cell promoters will comprise the promoter of the thymidine kinase of the HSV virus or also the promoter of the mouse's metallothioneine-L.

[0210] Generally, for selecting an adapted promoter, those skilled in the art could advantageously refer to the above-mentioned work by SAMBROOK et al. (1989) as well as to the techniques as disclosed by FULLER et al. (1996).

[0211] Vectors particularly adapted for an expression of the nucleic acids according to the invention in bacteria are, for example, the pQE70, pQE60 or pQE-9 vectors (commercialized by QIAGEN company), the pBluescript, Page script, pNH8A; pNH16a, pNH18a, pNH46A vectors (commercialized by Stratagene corporation), the pKK223-3, pKK233-3, pDR540 and pRIT5 vectors (commercialized by Pharmacia corporation).

[0212] Vectors being particularly adapted for an expression in eukaryote cells are for example the pWLNEO, pSV2CAT, pOG44, pXT1 and pSG vectors (commercialized by Stratagene corporation), the pSVK3, pBPV, pMSG and pSVL vectors (commercialized by Pharmacia corporation).

[0213] A first vector preferably implemented within the scope of the invention is the pcDNA3 vector commercialized by Invitrogen corporation.

[0214] A second particularly preferred vector is the pBluescript SK (−) vector commercialized by Stratagene corporation.

[0215] The preferred bacterial vectors according to the invention are for example the pBR322(ATCC37017) vectors as well as vectors such as pAA223-3 (Pharmacia, Uppsala, Sweden), and pGEM1 (Promega Biotech, Madison, Wis., USA).

[0216] Are also to be mentioned other commercialized vectors such as the psiX174, pBluescript SA, pNH8A, pNH16A, pNH18A, pNH46A, pWLNEO, pSV2CAT, pOG44, pXTI, pSG(Stratagene) vectors.

[0217] They could also be vectors of the baculovirus type such as the pVL1392/1393 vector (Pharmingen) used for transfecting the cells of the Sf9 line (ATCC N^(o)CRL 1711) derived from Spodoptera frugiperda.

[0218] They could also be adenoviral vectors such as human adenovirus of 2 or 5 type.

[0219] A recombinant vector according to the invention could also be a retroviral vector as well as an adeno-associated vector (AAV). Such adeno-associated vectors are for example disclosed by FLOTTE et al. (1992), SAMULSKI et al. (1989), as well as McLAUGHLIN BA et al. (1996).

[0220] Most preferably, the pBAD TOPO vector commercialized by Invitrogen Corporation is implemented to allow for an expression of the thyX gene in E. Coli which is precisely regulated by the presence or the absence of arabinose in the culture medium of the transfected cells with such a recombinant vector, because the pBAD TOPO vector comprises the P_(BAD) promoter being arabinose inducible.

[0221] Examples of Thymidylate Synthase Activity Inhibiting Compounds of a THYX Polypeptide

[0222] Such thymidylate synthase activity inhibiting compounds of a THYX polypeptide are potentially anti-bacterial and/or anti-viral compounds without unwanted effects or with reduced unwanted effects for mammals, including man.

[0223] The growth of the transformants containing thyX in the E.coli X2913(ΔthyA) strain could be inhibited by trimethoprim, which is a specific inhibitor for the dihydrofolate reductase. This fact indicates that the growth depends on the folates.

[0224] Similarly, it is shown according to the invention that the thymidylate synthase activity of the polypeptide coded by the THYX gene of Pyrococcus abyssi is inhibited by a metabolic derivate of 5-fluorouracil, suggesting that the THYX polypeptide interacts with the fluoro-dUMP compound.

[0225] Other compounds able to inhibit the thymidylate synthase activity of a THYX polypeptide are respectively the antisense nucleic acids specifically hybridizing with the messenger RNA coding a THYX polypeptide and the antibodies raised against a THYX polypeptide.

[0226] Antisense Nucleic Acids

[0227] In order to inhibit or to block the expression of a nucleic acid coding a THYX polypeptide, those skilled in the art could use antisense polynucleotides.

[0228] Thus, the invention also relates to using an antisense polynucleotide or oligonucleotide able to specifically hybridize itself with the messenger RNA coding a THYX polypeptide and able to inhibit or to block the transcription and/or the translation thereof. Such a polynucleotide has the general structure being defined in the present description for the probes and the primers according to the invention.

[0229] Preferably, an antisense polynucleotide capable to be used according to the invention comprises a sequence corresponding to a sequence located in the region of the 5′ end of the messenger RNA, and most preferably in the vicinity of the initiation codon of the translation (ATG) of the nucleic acid coding the THYX polypeptide.

[0230] According to a second preferred embodiment, an antisense polynucleotide according to the invention comprises a sequence corresponding to one of the sequences located at the level of the exon/intron junctions of a gene coding the THYX polypeptide and most preferably, sequences corresponding to a splicing site.

[0231] An antisense polynucleotide according to the invention could be prepared from a nucleic acid coding a THYX polypeptide selected amongst the polypeptides with sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.

[0232] An antisense polynucleotide according to the invention could be prepared from a nucleic acid selected amongst the nucleotidic sequences SEQ ID N^(o)44 to SEQ ID N^(o)64.

[0233] Generally, the antisense polynucleotides should have a length and a melting temperature suffificient for allowing to form an intracellular duplex hybrid with a sufficient stability for inhibiting the expression of the mRNA coding the subject THYX polypeptide. Strategies for building up antisense polynucleotides are more particularly disclosed by Green et al. (1986) and Izant and Weintraub (1984).

[0234] Methods for building antisense polynucleotides are also disclosed by Rossi et al. (1991) as well as in the PCT Applications WO 94/23026, WO 95/04141, WO 9218522 and in the European Patent Application EP 0 572 287.

[0235] Advantageously, an antisense polynucleotide according to the invention is 15 to 200 nucleotides long. A sense polynucleotide of the invention has therefore a length varying from 15, 20, 25, 30, 35, 40, 45 or 50 to 75, 100, 150 or 200 nucleotides.

[0236] In order to inhibit or to block the expression of a nucleic acid coding a THYX polypeptide such as defined in the description, one could also simultaneously use a plurality of antisense polynucleotides such as defined hereinabove, each of the antisense polynucleotides hybridizing with a distinct region of the gene or its messenger RNA.

[0237] Other methods for implementing antisense polynucleotides are for example disclosed by Sczakiel et al. (1995) or also disclosed in the PCT Application WO 95/24223.

[0238] Antibodies

[0239] THYX polypeptides such as defined according to the invention, more particularly polypeptides with amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37, or variants thereof as well as the homologous peptides, could be used for preparing antibodies which are able to be selected for the ability to inhibit or to block their thymidylate synthase activity.

[0240] Still another object of the invention is the use of antibodies raised against a THYX polypeptide for inhibiting or blocking the thymidylate synthase activity of such a polypeptide.

[0241] Such antibodies specifically raised against a THYX polypeptide represent a new illustrative example of a thymidylate synthase activity inhibiting compound of a THYX polypeptide according to the invention.

[0242] Such antibodies potentially represent anti-bacterial or anti-viral compounds.

[0243] It is meant herein by “antibody”, more particularly polyclonal or monoclonal antibodies or fragments thereof (for example F (ab)′₂, Fab fragments) or also any polypeptide comprising a domain of the initial antibody recognising the target polypeptide or polypeptide fragment according to the invention.

[0244] Monoclonal antibodies could be prepared from hybridomas according to the technique disclosed by KOHLER and MILSTEIN (1975).

[0245] The present invention is also relative to antibodies raised against a polypeptide such as described hereinabove or a fragment or a variant thereof, such as produced in the trioma technique as well as in the hybridoma technique as disclosed by KOZBOR et al. (1983).

[0246] The invention is also relative to single chain antibody fragments Fv (ScFv) such as disclosed in the U.S. Pat. N^(o) 4,946,778 or also by MARTINEAU et al. (1998).

[0247] The antibodies according to the invention also comprise antibody fragments obtained using phage banks from RIDDER et al., (1995) or also humanized antibodies (REIMANN et al., 1997; LEGER et al., 1997).

[0248] Pharmaceutical Compositions

[0249] Another object of the invention is also an anti-bacterial or anti-viral pharmaceutical composition comprising, as an active principle an antisense oligonucleotide specically hybridizing with a messenger RNA coding a THYX polypeptide such as defined in the present description, in association with one or more physiologically compatible excipients.

[0250] It is also relative to using an antisense oligonucleotide specifically hybridizing with the messenger RNA coding a THYX polypeptide such as defined in the present description for producing an anti-bacterial or an anti-viral drug.

[0251] Such a pharmaceutical composition will preferably comprise antisense oligonucleotide concentrations being at least equimolar with those of the corresponding messenger RNA in the cell.

[0252] Amongst the excipients useful in association with an antisense oligonucleotide such as defined hereinabove, are to be mentioned the synthetic cationic molecules binding to the anionic sites of the antisense oligonucleotide, which aids to the passage of the antisense oligonucleotide through the cell membrane via a non specific endocytosis, including those disclosed by Schofield in 1995 or those disclosed by BEHR in 1994.

[0253] Another useful excipient in association with an antisense oligonucleotide according to the invention is the Lipofectin™ compound, which comprises a 1:1 formulation of the quaternary ammonium compound DOTMA and dioleoylphosphatidylethalolamine, sonicated in the form of small unilamellar vesicles in water.

[0254] Another object of the invention is also an anti-bacterial or an anti-viral pharmaceutical composition comprising as an active principle, an antibody specifically raised against a THYX polypeptide such as defined in the description, in association with one or more physiologically compatible excipients.

[0255] It is also relative to using an antibody specifically raised against a THYX polypeptide such as defined in the description for producing an anti-bacterial or an anti-viral drug.

[0256] The invention is also relative to a method for preventing or for treating a bacterial or a viral disease, said method comprising a step for administrating a therapeutically efficient amount of an antisense oligonucleotide or antibody specific for a THYX polypeptide such as defined hereinabove.

[0257] A pharmaceutical composition according to the invention could be administered by any route, for example through intraveinous, intramuscular, oral or mucosal route, in association with a physiologically compatible carrier and/or adjuvant or excipient.

[0258] An antibody specifically raised against a THYX polypeptide is present in a pharmaceutical composition according to the invention in amounts adapted for a daily administration of 10 nanogrammes to 10 mg of antibody, preferably from 100 nanogrammes to 1 mg and more preferably from 1 pg to 100 pg antibody.

[0259] Techniques for formulating and administrating thymidylate synthase inhibiting compounds of a THYX polypeptide could be found by those skilled in the art in the following work: <<REMINGTON'S PHARMACEUTICAL SCIENCES-MACK publication co., Easton, Pa.>>, in its latest edition.

[0260] Use of Nucleotidic Probes and Primers Hybridizing with a Nucleic Acid Coding a THXY Polypeptide

[0261] As already set forth previously, the thyX genes were found according to the invention in various bacteria and viruses pathogenic in mammals, in particular bacteria pathogenic in man, and viruses.

[0262] Consequently, probes or primers derived from genomic nucleic acids or from the messenger RNA coding a THYX polypeptide are means for detecting the presence of a pathogenic bacterium or virus in a sample, and more specifically a biological sample taken from man or from an animal, for example a sample of saliva, tears, blood, plasma or serum or also from a biopsy sample or a smear.

[0263] The nucleic acids derived from any of the nucleotidic sequences coding a THYX polypeptide such as defined in the description, more particularly the nucleic acids with sequences SEQ ID N^(o)44 to SEQ ID N^(o)64 are useful for detecting the presence of at least one copy of a nucleotidic sequence selected amongst the sequences SEQ ID N^(o)44 to SEQ ID N^(o)64 or also of a fragment or a variant thereof in a sample.

[0264] Preferably, nucleotidic probes or primers according to the invention will have a length of 10, 12, 15, 18 or 20 to 25, 35, 40, 50, 70, 80, 100, 200, 500, 1000, 1500 consecutive nucleotides of a nucleic acid coding a THYX polypeptide or of a nucleic acid with a complementary sequence.

[0265] Alternately, a nucleotidic probe or primer according to the invention will consist in and/or comprise fragments with a length of 12, 15, 18, 20, 25, 35, 40, 50, 100, 200, 500, 1000, 1500 consecutive nucleotides of a nucleic acid coding a THYX polypeptide according to the invention, more particularly a nucleic acid selected amongst the sequences SEQ ID N^(o)44 to SEQ ID N^(o)64, or a nucleic acid with a complementary sequence.

[0266] The definition of a nucleotidic probe and primer according to the invention encompasses oligonucleotides hybridizing, in the strongly stringent hybridization conditions as defined hereinafter, with a nucleic acid coding a THYX polypeptide, in particular a nucleic acid selected amongst the sequences SEQ ID N^(o)44 to SEQ ID N^(o)64 or with a complementary sequence thereof.

[0267] Definition of the Hybridization Conditions

[0268] It is meant by strongly stringent hybridization conditions, as used herein, the following hybridization conditions.

[0269] Prehybridization

[0270] Same conditions as for the hybridization

[0271] duration: 1 night.

[0272] Hybridization

[0273] 5×SSPE (0.9 M NaCl, 50 mM sodium phosphate pH 7.7, 5 mM EDTA)

[0274] 5×Denhardt's (0.2% PVP, 0.2% Ficoll, 0.2% SAB)

[0275] 100 μg/ml DNA of salmon's sperm

[0276] 0.1% SDS

[0277] duration: 1 night.

[0278] Washing Operations

[0279] 2×SSC, 0.1% SDS 10 min 65° C.

[0280] 1×SSC, 0.1% SDS 10 min 65° C.

[0281] 0.5×SSC, 0.1% SDS 10 min 65° C.

[0282] 0.1×SSC, 0.1% SDS 10 min 65° C.

[0283] The parameters defining the stringency conditions depend on the temperature at which 50% of the coupled strands are separated from each other (Tm).

[0284] For the sequences comprising more than 360 bases, Tm is defined by the relationship:

[0285] Tm=81.5+0.41 (% G+C)+16.6 Log(cation concentration)−0.63 (% formamide)−(600/number of bases) (SAMBROOK et al., (1989), pages 9.54-9.62).

[0286] For sequences with a length lower than 30 bases, Tm is defined by the relationship: Tm=4(G+C)+2(A+T).

[0287] Under the appropriate stringency conditions, where the aspecific sequences do not hybridize, the hybridization temperature is approximatively from 5 to 30° C., preferably from 5 to 10° C. below Tm.

[0288] The above described hybridization conditions are implemented for hybridizing a nucleic acid being 200 base long and could be adapted depending on the length of the nucleic acid the hybridization of which is desired, or of the selected marking type, according to the techniques known to those skilled in the art.

[0289] The appropriate hybridization conditions could for example be adapted according to the teaching from the work by HAMES and HIGGINS (1985) or also from the work of AUSUBEL et al. (1989).

[0290] More particularly, it is to be noted that the hybridization level and specificity depend on various parameters such as:

[0291] a) the purity of the preparation of the nucleic acid on which the probe or the primer has to hybridize;

[0292] b) the base composition of the probe or of the primer, the G-C base pairs having a higher thermal stability than the A-T or A-U base pairs;

[0293] c) the length of the homologous base sequence between the probe or the primer and the nucleic acid;

[0294] d) the ionic strength: the hybridization rate increases with the increase of the ionic strength and the incubation time duration;

[0295] e) the incubation temperature;

[0296] f) the concentration of the nucleic acid on which the probe or the primer has to hybridize;

[0297] g) the presence of denaturants, such as agents promoting the break of hydrogene links, such as formamide or urea, increasing the stringency of the hybridization;

[0298] h) the incubation time, the incubation rate increasing with the incubation duration;

[0299] i) the presence of volume excluding agents, such as dextran or dextran sulfate, increasing the hybridization rate as they increase the effective concentrations of the probe and the primer and of the nucleic acid that should hybridize to, within the preparation.

[0300] A nucleotidic primer or probe according to the invention could be prepared using any adapted method well known to those skilled in the art, including through cloning and action of restriction enzymes or also through direct chemical synthesis according to techniques such as the phosphodiester method by NARANG et al. (1979) or by BROWN et al. (1979), the diethylphosphoramidite method by BEAUCAGE et al. (1980) or also the solid substrate technique as disclosed in the EU Patent EP 0,707,592.

[0301] Each of the nucleic acids according to the invention, including the above described oligonucleotidic probes and primers, could be labeled, if desired, by incorporating a marker being detectable by spectroscopic, photochemical, biochemical, immunochemical or also chemical means.

[0302] For example, such markers could comprise radioactive isotopes (³²p, ³³p, ³H, ³⁵S), fluorescent molecules (5-bromodeoxyuridin, fluorescein, acetylaminofluorene, digoxigenin) or also ligands such as biotin.

[0303] Probe labeling preferably occurs by incorporating labeled molecules within polynucleotides through primer extension, or also through addition on the 5′ or 3′ ends.

[0304] The oligonucleotide probes according to the invention could be used, more particularly, in hybridizations of the Southern type with genomic DNA or also in hybridizations with the corresponding messenger RNA when the expression of the corresponding transcript is sought for in a sample.

[0305] The probes according to the invention could also be used for detecting PCR amplification products or also for detecting mismatch pairings.

[0306] Nucleotidic probes and primers according to the invention could be immobilized on a solid substrate. Such solid substrates are well known to those skilled in the art and comprise surfaces of microtitration plate wells, polystyrene beds, magnetic beds, nitrocellulose bands or also microparticles such as latex particles.

[0307] Consequently, another object of the invention is the use of a nucleic probe or primer hybridizing with a nucleic acid coding a THYX polypeptide, such as defined in the description, in a method for detecting a bacterium or a virus, more particularly a bacterium or virus pathogenic in mammals, including man.

[0308] The present invention also relates to a method for detecting the presence of a nucleic acid such as described hereinabove in a sample, said method comprising the steps of:

[0309] 1) contacting one or more nucleotidic probes according to the invention with the sample to be tested;

[0310] 2) detecting the complex optionally formed between the probe(s) and the nucleic acid present in the sample.

[0311] According to a particular embodiment of the detection method according to the invention, the oligonucleotidic probe(s) is/are immobilized on a substrate.

[0312] According to another aspect, the oligonucleotidic probes comprise a detectable marker.

[0313] The invention additionally relates to a set or a kit for detecting the presence of a nucleic acid according to the invention in a sample, said set comprising:

[0314] a) one or more nucleotidic probes such as described hereinuder;

[0315] b) if need be, the reagents required for the hybridization reaction.

[0316] According to a first aspect, the detection set or kit is characterized in that the probe(s) is/are immobilized on a substrate.

[0317] According to a second aspect, the detection set or kit is characterized in that the oligonucleotidic probes comprise a detectable marker.

[0318] According to a particular embodiment of the above described detection kit, such a kit will comprise a plurality of oligonucleotidic probes in accordance with the invention able to be used for detecting target sequences of interest or alternatively for detecting mutations in the coding regions or the non coding regions of the nucleic acids according to the invention, more particularly nucleic acids with sequences SEQ ID N^(o)44 to SEQ ID N^(o)64 or nucleic acids with complementary sequence.

[0319] Thus, the probes according to the invention immobilized on a substrate could be ordered in matrices such as the “DNA chips”. Such ordered matrices have been more particularly described in the U.S. Pat. N^(o) 5,143,854, in the PCT Applications N^(o) WO 90/150 70 and 92/10092.

[0320] Substrate matrices on which oligonucleotidic probes have been immobilized at a high density are for example disclosed in the U.S. Pat. N^(o) 5,412,087 and in the PCT Application WO 95/11995.

[0321] The nucleotidic primers according to the invention could be used for amplifying any one of the nucleic acids according to the invention, and more particularly, all or part of a nucleic acid with sequences SEQ ID N^(o)44 to SEQ ID N^(o)64, or also a variant thereof.

[0322] Another object of the invention relates to a method for amplifying a nucleic acid according to the invention, and more particularly a nucleic acid with sequences SEQ ID N^(o)44 to SEQ ID N^(o)64 or a fragment or a variant thereof contained in a sample, said method comprising the steps of:

[0323] a) contacting the sample wherein the presence of the target nucleic acid is suspected with a pair of nucleotidic primers the hybridization position of which is located respectively on 5′ side and on 3′ side of the region of the target nucleic acid the amplification of which is being sought for, in the presence of the reagents required for the amplification reaction; and

[0324] b) detecting amplified nucleic acids.

[0325] In order to implement the above defined amplification method, one should advantageously use any of the hereinabove described nucleotidic primers.

[0326] Yet another object of the invention is a set or a kit for amplifying a nucleic acid according to the invention, and more particularly all or part of a nucleic acid with sequences SEQ ID N^(o)44 to SEQ ID N^(o)64, said set or kit comprising:

[0327] a) a nucleotidic primer couple in accordance with the invention, the hybridization position of which is located respectively on 5′ side and on 3′ side of the target nucleic acid the amplification of which is being sought;

[0328] b) if need be, the reagents required for the amplification reaction.

[0329] Such an amplification set or kit will advantageously comprise at least one pair of nucleotidic primers such as described hereinabove.

[0330] Use of a Nucleic Acid Coding a THXY Polypeptide as a Selection Marker.

[0331] There is a constant need in the state of the art for novel selection marker genes, particularly in methods aiming at introducing one or more genes of interest into a host organism, for example a host cell.

[0332] The selection marker genes, being carried by the DNA molecule coding the gene(s) of interest which are to be introduced into the host organism or in the host cell, and which are consequently introduced into the host cell simultaneously with the genes of interest, make it possible to select the recombinant host cells.

[0333] A nucleic acid coding a THYX polypeptide, when being used for transfecting auxotrophic host cells for the thymidine synthesis, is an excellent selection marker of transformation, transfection or recombination event of the host cell. Indeed, introducing a nucleic acid coding a THYX polypeptide simultaneously with one or more genes of interest into the host cell restores the autotrophy of the host cell being successfully subjected to the transfection, transformation or recombination.

[0334] In this way, one can select host cells having been subjected to the transfection, transformation or recombination event through medium selection pressure, and more particularly, cultivating the host cells in the absence of thymidine. In such a case, only the host cells having been recombined survive in the absence of thymidine in the culture medium.

[0335] Additionally, a nucleic acid coding a THYX polypeptide is totally non toxic for the environment.

[0336] Vectors comprising a nucleic acid coding a THYX polypeptide as a selection marker gene could be prepared from conventional vectors using techniques well known to those skilled in the art, for example from the preferred vectors according to the invention.

[0337] Still another object of the invention is the use of a nucleic acid coding a THYX polypeptide such as defined in the description as a selecting marker of a genetic transfection, transformation or recombination event of a host cell or a host organism.

[0338] It is also relative to a cloning and/or expression vector comprising a nucleic acid coding a THYX polypeptide such as defined in the description as selection marker genes for a genetic transfection, transformation or recombination event.

[0339] Reaction Media for the Thymidylate Synthase Activity of THXY and their Uses, more Particularly in Screening Methods

[0340] The inventors' work on the ThyX activity made it possible to identify the enzyme mechanism and thereby to optimize the conditions of the reaction by showing the importance of adding some compounds in the reaction medium.

[0341] Another aim of the invention is therefore the use of such media and the application thereof.

[0342] It also relates to a reaction medium for the thymidylate synthase activity of ThyX characterized in that it comprises reduced flavins and CH₂H₄folate.

[0343] More particularly, the reduced flavins of such a medium are obtained through in situ reduction of oxidized flavins. The oxidized flavin concentration is then 50 μM to 1 mM, preferably 0.5 mM. Preferably, the oxidized flavins are flavin mononucleotide (FMN) and/or flavin adenine dinucleotide (FAD). The flavin reduction could occur through chemical, enzymatic, photochemical or electrochemical route, and more particularly, with NADH (β-nicotinamide adenine dinucleotide) and/or NADPH (β-nicotinamide adenine dinucleotide phosphate).

[0344] Moreover, the CH₂H₄folate concentration is 50 μM to 2 mM, preferably 1 mM.

[0345] Preferably, the medium could additionally comprise dUMP (uridine 5′-monophosphate) at a concentration ranging from 1 μM to 800 μM, more preferably 500 μM.

[0346] When flavin reduction is performed with NADH and/or NADPH, the NADH concentration ranges from 0.1 to 1 mM, and the NADPH concentration from 0.5 to 5 mM.

[0347] The invention further aims at screening methods. In particular, an object of the invention is to provide a method for screening an anti-bacterial or an anti-viral compound in vitro in an acellular system characterized in that said method comprises the steps of:

[0348] a) preparing a cell lysate from a cell culture expressing a THYX polypeptide in the absence of a polypeptide coded by a thyA gene and comprising a medium according to the invention;

[0349] b) adding to the cell lysate obtained in step a) the inhibiting compound to be tested;

[0350] c) comparing the thymidylate synthase ThyX activity respectively in the cell lysate obtained in step a) and in the cell lysate obtained in step b); and

[0351] d) selecting the candidate compounds for which an inhibition of the thymidylate synthase ThyX activity has been detected.

[0352] In such a screening method, dTMP and ³H are preferably used as markers of the thymidylate synthase ThyX activity.

[0353] The invention also encompasses kits or sets for screening a thymidylate synthase ThyX inhibiting compound characterized in that it comprises:

[0354] a) a composition comprising a THYX polypeptide as well as a medium according to the invention, in solution or in freeze-dried form;

[0355] b) optionally, one or more reagents required for quantifying the thymidylate synthase ThyX activity. Alternatively, the kit or set for screening an anti-bacterial or an anti-viral compound, could also comprise:

[0356] a) a recombinant expression vector comprising a nucleic acid coding a THYX polypeptide under the control of a functional promoter in a host cell wherein its expression is being sought or a host cell transfected with such a recombinant vector;

[0357] b) a medium according to the invention;

[0358] c) optionally, one or more reagents required for quantifying the thymidylate synthase ThyX activity.

[0359] Other features and advantages of the invention are given by way of illustration in the following examples referring to the figures.

[0360]FIG. 5 illustrates biochemical analyses of the ThyX of H. Pylori.

[0361] (A) 12% SDS-PAGE and immublot analyses of ThyX isolated protein of H. pylori. 1.5 (Coomassie) and 0.3 (anti-V5) μg of pure protein have been detected respectively through coloration with Coomassie Blue or using monoclonal antibodies against an anti-V5 epitope (Invitrogen), through chemiluminescent detection. The expected molecular weight of ThyX of H. pylori is 31.5 kDa

[0362] (B) The spectroscopic analyses of ThyX of H. pylori indicate that ThyX is a flavoprotein. The 10 μM absolute spectrum of isolated enzyme (plain line) and of co-factor after its protein release (dashed line). The window shows the spectrum for the oxidized enzyme and the reduced dithionic protein.

[0363] (C) The release activity of the Tritium of the purified ThyX protein of H. pylon has been recorded in the presence (+MTHF) and in the absence (−MTHF) of CH₂H₄tetrahydrofolate.

[0364]FIG. 6 illustrates the dTMP formation activity of the ThyX protein of H. pylori. The enzyme reactions were performed as described in table 1, using marked dUMP in position 6. The reaction products were analyzed using a reverse phase column C 18 through isocratic elution, using 10 mM of phosphate buffer. The elution durations for dUMP and dTMP were determined using the genuine references.

[0365] The invention is further illustrated, without any limitation, by the following examples.

EXAMPLES Example 1

[0366] Expression of THYX Polypeptide of Helicobacter Pylori in E. Coli

[0367] The DNAs comprising the open reading frame coding the THYX polypeptide of Helicobacter pylori (strain 26.695) (sequence SEQ ID n^(o)21) were obtained through PCR amplification using primers specific for sequences SEQ ID N^(o)38 and SEQ ID N^(o)39 from the GHPEH26 clone publicly available from the AMERICAN TYPE CULTURE COLLECTION under the access number n^(o)628.507.

[0368] The DNA comprising the open reading frame coding the THYX polypeptide of Pyrococcus. abyssi (strain ORSAY) (SEQ ID n^(o)12) was prepared through PCR amplification using primers specific for sequences SEQ ID N^(o)40 and SEQ ID N^(o)41 from the chromosomic DNA of H. pylori (HP 1533).

[0369] The DNA comprising the open reading frame coding the THYX polypeptide of Campylobacter jejuni (strain NCTC 11168) (SEQ ID N^(o)27) was prepared through PCR amplification using primers specific for sequences SEQ ID N^(o)42 and SEQ ID N^(o)43 from the genomic DNA publicly available from the ATCC (American Type Culture Collection) under the access number 7008199.

[0370] The PCR amplification products were cloned in the pBAD TOPO TA vector activated by topoisomerase-l, commercialized by Invitrogen corporation, allowing for a strictly regulated expression of the gene of interest artificially inserted into the vector, in the E. coli bacterium.

[0371] The E. coli clones were characterized through sequencing DNA inserts contained in the pBADTOPOTA vector.

[0372] The theoretical molecular weights of the THYX polypeptides of H. pylori (HP1533), P. abyssi (PAB 0861) and C. jejuni (NCTC 11168) including an amino-terminal translation activator and carboxy-terminal V5 epitopes and histidin, are respectively 31.5 kDa, 33.7 kDa and 28 kDA.

[0373] The results presented in FIG. 2A show that the expression of the THYX polypeptide is induced in the presence of 0.2% of Arabinose in the culture medium, as this was detected using anti-V5 monoclonal antibodies commercialized by Invitrogen Corporation and used according to the manufacturer's recommendations.

[0374] Then the ability of the THYX polypeptides of H.pylori and P.abyssi to allow for the growth of the E.coli X2913 (Δthya572) strain, auxotrophic for thymidine, to multiply in the absence of thymidine has been tested.

[0375] The growth ability of the E.coli strains expressing THYX polypeptides of H. pylori and P. abyssi was determined after culture of the recombinant E. coli cells for 3 to 4 days in the presence or in the absence of a 0.2% concentration of Arabinose on a thymidine free M9 minimum agar medium (Michaels et al., 1990).

[0376] The results are presented in FIG. 2B.

[0377] As can be seen from the results in FIG. 2B, the expression of the thyX gene of H. pylori in the E. coli Chi2193 strain wich was induced by Arabinose, made it possible to complement such an E. coli strain initially deficient in thymidylate synthase activity and to restore the prototrophy of such an E. coli strain for thymidine.

[0378] On the other hand, the thyx gene of P. abyssi, after induction by Arabinose, did not made it possible to complement the E. coli X 2193 strain, reflecting the inability of a hyperthermophilic protein to be functional in a mesophilic host.

[0379] Both recombinant E.coli strains were nevertheless able to multiply in a minimum agar medium in the presence of thymidine at the final concentration of 50 μg/ml.

[0380] The Helicobacter pylori GHPEH26 clone contains a DNA insert of 1.5421 kb corresponding to nucleotides 1613133-1611.613 of the ORF reference AE00511, locus 10, HP 1533, referenced in the data bases such as DNA seq. Acc: AE000 51l.

[0381] The GHPEH26 clone is commercialized by TIGR/ATCC Microbial Genome Special Collection Corporation. The thyX gene of H. pylori obtained through PCR from such a clone represents a 693 pb long DNA fragment.

[0382] The results of electrophoresis SDS PAGE gel illustrated in FIG. 3 show that the E. coli strain transformed with DNA of Campylobacterjejuni produces a THYX polypeptide with the expected molecular weight.

[0383] Additionally, the immuno-imprint results illustrated in FIG. 4 show that the THYX protein of Campylobacterjejuni could be efficiently purified on a Nickel Ni-NTA column.

[0384] The above described results represent the first experimental demonstration that the conversion reaction of dUMP into dTMP could be performed in the cell by a thymidylate synthase other than a thymidylate synthase coded by a thyA gene, i.e. by the thymidylate synthase coded by a thyx gene.

Example 2

[0385] Identification of a THYX Polypeptide Family

[0386] Through the analysis of sequences of about 50,000 genes referenced in the data base of <<clusters>> of orthologous proteins (Tatuson et al., 2000) by a similarity research iterative method (Altschul et al., 1997), it was shown that the THYX sequences, similar to the THY1 sequence of H. pylori (HP 1533), were the only gene family having a mutually exclusive distribution with thyA, with the single exception of Mycobacterium tuberculosis which simultaneously comprises a thyX gene and a thyA gene.

[0387] The similarity iterative research was performed using the PSI-BLAST iterative program (Version 2.0) using various THYA sequences as <<lures>>. Selections (<<hits>>) having an expected value lower than 1.10⁻⁵ were considered as statistically significant. A threshold value for recruiting alignments in the successive iterations was 0.02.

[0388] The non-exhaustive results of the above mentioned protein similarity analysis are shown in Tables 1 and 2 hereinafter. TABLE 1 Comparison comparison (%) Similarity (%) A. aeolicus 26 47 A. pernix 25 38 B. anthracis 30 47 B. burgdorferi 23 38 phi-C31 bacteriophage 25 40 B. lactis 23 42 C. difficile (partial) 40 59 C. diptheria 25 45 C. glutamicum 23 42 C. jejuni 56 72 Chlamydia sp. 22 36 Chlorella virus 20 32 D. discoideum 23 41 D. vulgaris 23 46 G. sulfuureducens 28 48 Gp16 (bacteriophage) 24 39 Gp48 (bacteriophage) 25 44 H. pylori (HP1533) 100 100 H. pylori (jhp1421) 96 97 H. salinarium 22 43 Halobacterium sp. NRC-1 22 43 M. avium 24 41 M. bovis 24 41 M. leprae 21 40 M. tuberculosis 23 39 P. abyssi 23 47 P. furiosus 27 52 P. horikoshii 25 46 R. capsulatus 18 38 R. prowazekii 22 41 Roseophage S101 18 36 S. coelicolor 17 33 S. solfaraticus 26 45 Synechocystis sp. 19 34 T. acidophilum 21 42 T. denticola 28 48 T. maritime 28 46 T. pallidum 22 37 T. volcanium 20 34

[0389] TABLE 2 Species thyA¹ thyX DHFR tdk upp Comments Bacteria: Campylobacter − + − − + Food poisonings Jejuni Helicobacter − + − − − Forming stomach pylori ulcers Rickettsia − + − − − Typhus causal prowazekii agent Borrelia − + − − − Involved in burgdorferi Lyme's disease Treponema − + − − + Syphilis pallidum causal agent Chlamydia − + + − + Compulsory (3 species) intracellular pathogens Mycobacterium + + + − + Tuberculosis tuberculosis Thermotoga − + + + + Thermophilic maritima Archaebacteria Pyrococcus − + − − + Hyper- abyssi thermophilic Pyrococcus − + − − − Hyper- horikoshii thermophilic Eukaryotes: Dictyostelium N.A + N.A N.A N.A No accessible discoideum complete sequence genomic Virus: Bacterial and − + N.A N.A N.A eukaryotic DNA viruses (5 species - which ones?)

[0390] In Table 2 hereinabove, the results are presented of the tests of the presence, more particularly of the thyX and thyA genes in some of the organisms as listed in Table 1.

[0391] The results set forth in Table 1 hereinabove show that the thyX genes are present both in bacteria, in bacteriophages and in viruses. Amongst the eukaryotes, the D.dyscoidum organism is the only organism carrying a copy of a thyX gene.

[0392] The single presence of a thyX gene in numerous bacteria and the eukaryotic DNA virus as well as in numerous bacteria pathogenic in man makes it a preferred target for anti-bacterial or anti-viral compounds which do not interfere with the metabolic route of the thymidylate synthase THYA present in man.

[0393] Additionally, amongst the organisms listed in Table 1 hereinabove many of them do not comprise DHFR genes coding a dihydrofolate reductase, which is required for recycling an essential cofactor of the metabolism of thymidylate, the CH₂H₄-folate.

Example 3

[0394] Site Directed Mutagenesis of the THXY Gene from Helicobacter Pylori

[0395] A. Materials and Methods

[0396] For obtaining the mutants 1 to 6 as described in the <<Results>> Section, the site directed mutagenesis was performed using the <<Quick Change™>> kit commercialized by Stratagene Corporation in accordance with the manufacturer's recommendations.

[0397] For obtaining the mutants 7 and 8 as described in the <<Results>> Section, the site directed mutagenesis was performed using the <<QuickChange™ Multi Site-Directed Mutagenesis Kit>> kit commercialized by Stratagene Corporation in accordance with the manufacturer's recommendations.

[0398] The starting DNA (<<Template>>) being used is the pBAD TOPO plasmid, commercialized by InVitrogen Corporation, wherein there was inserted the thyX gene of Helicobacter pylori 26695.

[0399] The preserved serine 107 residue (<<AGT>> codon) was respectively replaced by cysteine (<<TGC>> codon) or alanine (<<GCT>> codon) residues using the appropriate mutagenic oligonucleotides.

[0400] The Tyrosine 110 residue (<<TAC>> codon) was respectively replaced by threonine (<<ACT>> codon) and phenylalanine (<<TTC>> codon) residues using the appropriate mutagenic oligonucleotides.

[0401] The Glutamate 142 residue (<<GAA>codon) was respectively replaced by Alanine (<<GCT>> codon) and Aspartate (<<GAT>> codon) residues using the appropriate mutagenic oligonucleotides.

[0402] The Histidine 71 residue (<<CAT>> codon) was replaced by the Glutamine residue (<<CAA>> codon) using the appropriate mutagenic oligonucleotides.

[0403] The Glutamate 205 residue (<<GAA>> codon) was replaced by the Leukine (<<TTA>> codon) residue using the appropriate mutagenic oligonucleotides.

[0404] B. Results

[0405] The mutagenesis experiments showed that the E. coli Chi2193 bacteria transformed by the pBAD TOTO TA vectors respectively containing DNA inserts coding the THYX protein wherein the preserved serine amino acid residue was respectively replaced by a cysteine or an alanine residue, expressed a mutated THYX protein containing a flavin.

[0406] On the other hand, both mutant transformed bacteria lost their ability to restore the prototrophy for the thymidine.

[0407] These results show that the preserved serine amino acid residue is essential for the catalytic activity of the THYX protein.

[0408] Additionally, the ability of the thyX gene to complement the bacteria for the thymidylate synthase activity is lost when the Tyrosine 110 and Glutamate 142 residues are mutated.

[0409] A mutation through substitution of Glutamine for Histidine 71 leads to the production of a polypeptide capable to interact with the flavin co-factor, but which does not complement the bacteria for the thymidylate synthase activity.

[0410] Substituting a Leukine residue for the Glutamate 205 residue blocks the production of the THYX polypeptide.

[0411] In additional experiments, the thyX gene of the Rhodobacter capsulatus bacterium was inactivated (through <<knock out>>). The bacteria lacking THYX protein expressed an auxotrophy phenotypic character for thymidine, i.e. they only multiply in the presence of thymidine. These latter results clearly show that the THYX proteins are essential for the thymidine metabolism. Such experiments show that the thyX proteins are essential to the bacterial growth in the absence of thymidine, showing the usefulness of the thyX gene or the THYX polypeptide as a therapeutic target.

Example 5

[0412] Identification of the Reaction Mechanism of THXY

[0413] For identifying the biochemical reactions catalyzed by the ThyX proteins, a ThYX of H. pylori bearing a Histidine marker on its carboxy end was purified from acellular extracts from _(X)2913 strain of E. coli subjected to an induction by arabinose in affinity chromatography of immobilized nickel.

[0414] Open reading frames coding hypothetic proteins now found to correspond to the thyX genes of P. abyssi (PAB0861) and H. pylori (HP1533), were obtained through PCR, using specific primers and chromosomic DNA of P. abyssi and the GHPEH26 clone (American Type Culture Collection n^(o) 628507), respectively, as matrices. The PCR products were cloned in the pBAD TOPO® TA I-activated vector of topoisomerase (Invitrogen), allowing for the strictly controlled expression of the gene in E. coli. All the plasmidic clones were confirmed through DNA sequencing. The expected molecular masses of PAB0861 and HP1533, comprising an amino-terminal translation activating sequence and a carboxy-terminal V5 region and hexahistidin epitopes, are respectively 33.7 and 31.5 kDa. The expression of the protein was induced with 0.2% L-arabinose. The expressed proteins were detected through the use of V5-specific monoclonal antibodies (Invitrogen) in accordance with the manufacturer's recommendations.

[0415] The biologically active ThyX protein with a labeled antigenic site of H. pylori was purified from 200 ml of pGL2/E. coli X2913 [ΔthyA (table 3)] culture after 2 hours induction by 0.2% of L-arabinose. A QIA expression kit (Qiagen) under standard endogenous conditions was used for the purification as indicated by the manufacturer, comprising 10% (volume/volume) glycerol in all the buffers. The resulting protein samples were dialyzed against 50 mM of a phosphate, at pH 7.4 and 10% (volume/volume) glycerol buffer after elution so as to remove imidazole. The protein concentration in pure samples was evaluated through A₂₈₀ reading, justifying the A₂₈₀ absorbency of the flavin co-factor with a non covalent link, at 35560 M⁻¹ cm⁻¹, calculated with respect to the known amino acid sequence, and was used for the ThyX apoprotein of H. pylori.

[0416] The resulting protein preparations (purity>95%) usually contained 1 to 2 mg/ml of protein, with a molecular mass of approximately 31 kDa on SDS-PAGE gels (the ThyX expected molecular mass of H. pylori is 31.5 kDa) (FIG. 5A), and had a light yellow colour. The size exclusion chromatography with Superdex 200 using standard molecular weight markers, showed an endogenous molecular mass of 111 kDa (r=0.9874) for such a protein, suggesting that its active form could correspond to a homotetramer. Spectroscopic analyses of the isolated (oxidized) protein showed absorbency features typical for a flavoprotein (FIG. 5B), with large peaks at 447.5 and 375 nm. Such absorption peaks were found to be absent from the dithionite reduced enzyme. Similar absorption features were found for the co-factor after its release from the protein through denaturation using heat at 80° C. for 5 minutes. Using HPLC chromatography, the co-factor associated to the ThyX of H. pylori was identified as a FAD (flavin-adenine dinucleotide). It has been considered that the various enzyme preparations of ThyX of H. pylori contain 0.4 to 0.5 molecules of FAD per monomer. Globally, such spectroscopic properties indicate that the ThyX of H. pylori is a flavoprotein and/or uses flavin co-factors in the catalysis.

[0417] In dTMP formation, the loss of tritium from [5-H]dUMP in the solvent is a compulsory intermediary, allowing to quantify the thymidylate-synthetase activity after removal of radioactive nucleotides from the reaction mixtures (ROBERTS, 1966). In order to tackle the biochemical mechanism, wherein ThyX could circumvent the requirement for the ThyA in the de novo synthesis of thymydilate, the purified ThyX protein was used to the same extent.

Example 6

[0418] Optimization of the ThyX Reaction Conditions

[0419] N⁵,N¹⁰—CH₂H₄folate was non-enzymatically formed, through incubation of 2 mM of tetrahydrofolic acid (Sigma®) with 100 mM of β-mercaptoethanol and 20 mM of formol for 30 minutes in the dark and at room temperature. The tritium release dosages with a purified enzyme were achieved in 50 mM of Tris-Cl, at pH 7.9, comprising a 1 mM CH₂H₄folate preparation obtained as described hereinabove. The control reactions were achieved under analogous conditions, without tetrahydrofolic acid. The reactions at 50 μl were started through the addition of 6 μM of [5-³H]dUMP, a specific activity at 16,2 Ci/mmol (Amersham) and stopped after 60 minutes at 37° C. by two extractions with 250 μl of active carbon [10% (weight/volume) of Norit A] in 2% trichloroacetic acid so as to remove the nucleotides from the reaction mixtures. The remanent radioactivity in the supernatant was determined according to MYLLYKALLIO (2000).

[0420] As can be seen from FIG. 5C, the ThyX catalyzes in vitro the tritium release from (5-³H)dUMP depending on the protein concentration and in a CH₂H₄folate dependent way, showing the biochemical activity of the ThyX proteins. Optimized reaction conditions are listed in Table 3. TABLE 3 Optimization of reaction conditions for the ThyX protein of H. pylori nmol of released ³H/mg of Test conditions protein (60 minute incubation) ThyX of H. pylori: Complete 63.0 (100%) Complete, −protein  0.7 (1.1%) Complete, −H₄folate  0.8 (1.2%) Complete, −NADH, −NADPH, −FMN  2.2 (3.4%) Complete, −FMN 2.70 (4.3%) Complete, −NADPH 37.3 (59.2%) Complete, −NADH 19.0 (30.1%) Complete, +500 μM Dump  9.3 (14.8%) Complete, +500 μM UMP 64.4 (102.2%) #could be efficiently replaced by FAD (flavin adenine dinucleotide). In such reaction conditions, 20 μM of dUMP and 100 μM of CH₂H₄folate are sufficient for saturating the release activity of tritium of the ThyX protein from H. pylori during 60 minutes incubation.

[0421] Surprisingly, it has been found that the addition of flavin reduced nucleotides drastically increases the tritium release activity of the ThyX from H. pylori (=0.01 μmol of H₂O formed per min and per mg of protein, as measured during additional experiments of absorption time).

[0422] Similarly, it has been shown by means of simple competition experiments that the ThyX activity does not use UMP as a substrate (Table 3) and that such an activity is inhibited by dTMP micromolecular concentrations. The tritium release activity of the ThyX from H. pylori is directly linked to the formation of dTMP (FIG. 6).

[0423] Experimental results therefore clearly showed that the ThyX proteins act as a dUMP dependent thymidilate-synthetase (FIGS. 5, 6).

[0424] Moreover, it has also been shown that the fluoro-dUMP acts as a THYX protein inhibitor. Such results show that the monophosphate deoxynucleotides could be used for identifying new THYX inhibitors.

[0425] For a practical application, those results also show that the tests are also useful for screening THYX proteins.

[0426] Consequently, while the catalysis through ThyX depends on the reduced flavin nucleotides, the ThyA uses H₄folate electrons for forming the methyl functional group. The CH₂H₄folate acts in the reaction catalyzed by ThyX only as a carbon donor, thereby leading to the formation of H₄folate as a reaction product. Such a reaction mechanism clearly explains the reason why dihydrofolate-reductase is not indispensable to the formation of thymidylate by the ThyX proteins (Table 4). TABLE 4 Species thyA DHFR tdk thyX Comments Bacteria: Campylobacter − − − + Food Jejuni poisonings Helicobacter pylori − − − + Formation of stomach ulcers Rickettsia − − − + Typhus causal prowazekii agent Borrelia − − − + Involved in burgdorferi Lyme's disease Treponema pallidum − − − + Syphilis causal agent Chlamydia (3 − + − + Compulsory species) intracellular pathogens Mycobacterium + + − + Tuberculosis tuberculosis Archaebacteria: Pyrococcus abyssi − − − + Hyperthermophilic Pyrococcus − − − + Hyperthermophilic horikoshii Sufolobus − − − + Hyperthermophilic solfaraticus Eukaryotes: Dictyostellium N.A N.A. + No complete genomic sequence Virus: Bacterial and − N.A + eukaryotic DNA viruses (5 species)

[0427] It is to be noted that, in the Chlamydia sequence, the Serine residue is either absent, or is located at the aminoterminal end of the protein.

[0428] The differences in the enzyme mechanism of the two different classes of thymidylate-synthetases are also due to the absence of sequence patterns essential for the catalysis in the ThyA and ThyX proteins. Those data have made it possible to identify analogues of dUMP, dTMP, folate and flavin nucleotides as ideal candidates for key compounds for identifying novel compounds inhibiting the ThyX activity.

[0429] REFERENCES

[0430] ALSTCHUL S F, Madden T L, SCHAFFER M, ZHANG J, ZHANG Z, Miller W. & LIPMAN DJ Nucleic Acids Res. 25, 3389-3402 (1997).

[0431] Beaucage et al., Tetrahedron Lett 1981, 22: 1859-1862 Behr, Bioconjugate Chem., 5, 382-389 (1994)

[0432] Brown E L, Belagaje R, Ryan M J, Khorana H G, Methods Enzymol 1979;68:109-151

[0433] CARRERAS, C. W. and SANTI, D. V., 1995, Annu. Rev. Biochem. vol.64: 721-763.

[0434] DYNES J. L. and FIRTEL R. A., 1989, Proc. Natl. Acad. Sci. USA, vol.86: 7966-7970.

[0435] Fuller S. A. et al., 1996, Immunology in Current Protocols in Molecular Biology, Ausubel et al. Eds, John Wiley & Sons, Inc., USA

[0436] Green et al., Ann. Rev. Biochem. 55:569-597 (1986)

[0437] Houben Weyl, 1974, in Meuthode der Organischen Chemie, E. Wunsch Ed., Volume 15-I et 15-II,

[0438] Izant J G, Weintraub H, Cell 1984 Apr;36(4):1007-15

[0439] Koch Y., 1977, Biochem. Biophys. Res. Commun., 74:488-491

[0440] Kohler G. and Milstein C., 1975, Nature, 256: 495.

[0441] Kozbor et al., 1983, Hybridoma, 2(1):7-16.

[0442] Leger O J, et al., 1997, Hum Antibodies, 8(1): 3-16

[0443] Martineau P, Jones P, Winter G, 1998, J Mol Biol, 280(1):117-127

[0444] Merrifield R B, 1965a, Nature, 207(996): 522-523.

[0445] Merrifield R B, 1965b, Science, 150(693): 178-185.

[0446] Michaels, M. L. Kim, C. W. Matthews D. A. & Miller, J. H. Proc. Natl. Acad. Sci. USA 87, 3957-3961 (1990).

[0447] Myllykallio, H. et al., Science 288, 2212 (2000).

[0448] Narang S A, Hsiung H M, Brousseau R, Methods Enzymol 1979;68:90-98

[0449] PAPAMICKAEL D. 2000, STEM. Cells, vol.18.166-175.

[0450] Reimann K A, et al., 1997, AIDS Res Hum Retroviruses. 13(11): 933-943

[0451] Ridder R, Schmitz R, Legay F, Gram H, 1995, Biotechnology (N Y), 13(3):255-260

[0452] Roberts, D., Biochemistry 5, 3546 (1996).

[0453] Rossi et al., Pharmacol. Ther. 50:245-254, (1991)

[0454] Sambrook, J. Fritsch, E. F. and T. Maniatis. 1989. Molecular cloning: a laboratory manual. 2ed. Cold Spring Harbor Laboratory, Cold spring Harbor, New York.

[0455] Schofield, Brit. Microencapsulated. Bull., 51(1):56-71 (1995)

[0456] Sczakiel G. et al., 1995, Trends Microbiol., 1995, 3(6):213-217

[0457] Carreras C. W. and Santi D. V.,ANNUAL Review on Biochemistry, 1995, 64: 721-762.

[0458] Poole A., Penny D. and Sjöberg B. M._Nature Reviews, Mol. Cell. Biol. 2001, 2: 147-151

[0459] Papamichael D., 1999, The Oncologist, 4(6): 478-487.

[0460] TATUSOV R. L. GALPERIN, M. Y. Natale, D. A. & KOONIN, E. V. Nucleic Acids Res. 28, 33-36 (2000).

1 64 1 265 PRT Borrelia burgdorferi 1 Met Asn Lys Glu Tyr Lys Ile Leu Asp Asn Gly Phe Leu Lys Leu Ile 1 5 10 15 Asp Phe Met Gly Asp Asp Arg Arg Ile Val Lys Ala Ala Arg Ile Ser 20 25 30 Tyr Arg Glu Glu Ser Val Lys Arg Lys Asp Ala Glu Leu Ile Asp Tyr 35 40 45 Leu Ile Arg Asn Gly His Thr Ser Pro Leu Glu Gln Val Val Phe Thr 50 55 60 Phe His Val Lys Ala Pro Ile Phe Val Ala Arg Gln Trp Met Arg His 65 70 75 80 Arg Thr Ala Arg Ile Asn Glu Val Ser Gly Cys Tyr Ser Leu Ala Arg 85 90 95 Glu Glu Phe Tyr Val Pro Leu Glu Glu Asp Leu Lys Cys Gln Thr Ser 100 105 110 Ser Asn Ser Ser Glu Lys Glu Phe Lys Ser Leu Glu Lys Leu Ser Asp 115 120 125 Lys Ile Lys His His Gln Lys His Ser Tyr Glu Leu Tyr Gln Asp Met 130 135 140 Ile Asn Ala Asn Ile Pro Lys Glu Leu Ser Arg Ile Val Leu Pro Leu 145 150 155 160 Ser Leu Tyr Thr Glu Trp Tyr Trp Gln Ile Asp Leu Asn Asn Leu Phe 165 170 175 His Phe Ile Lys Leu Arg Leu Ala Leu Asp Ser Pro Lys Glu Ile Lys 180 185 190 Glu Asn Ser Pro Lys Glu Met Arg Glu Tyr Ala Lys Ala Leu Ile Ser 195 200 205 Ile Val Arg Glu Ile Val Pro Ile Ala Phe Asn Ser Phe Glu Asn His 210 215 220 Phe Leu Arg Gly Lys Arg Phe Ser His Glu Glu Ile Ile Ala Ile Ile 225 230 235 240 Asn Ala Leu Asp Leu Asn Lys Leu Ser Met Asp Ala Glu Lys Leu Asn 245 250 255 Leu Leu Lys Asp Lys Leu Gly Ile Asp 260 265 2 305 PRT Treponema pallidum 2 Met Thr Leu Arg Thr Leu Gln Ala Gly Val Ala Val Ser Ile Ala Leu 1 5 10 15 Asp Arg Val Cys Phe Phe Cys Tyr Asn Gly Ala Val Ala His Cys Val 20 25 30 Val Glu Ala Ala Glu Asp Ile Leu Asp Arg Arg Phe Ser Val Leu Asp 35 40 45 Lys Gly Phe Val Arg Leu Ile Asp Tyr Leu Gly Gly Asp Ala Arg Ile 50 55 60 Val Gln Ala Ala Arg Val Ser Tyr Gly Ala Gly Thr Arg Thr Ala Arg 65 70 75 80 Asp Asp Ala Ala Leu Ile Asp Phe Leu Leu Arg Asn Lys His Thr Ser 85 90 95 Pro Phe Glu Gln Val Val Leu Thr Phe His Val Arg Ala Pro Ile Phe 100 105 110 Val Ala Arg Gln Trp Met Arg His Arg Thr Ala Arg Ile Ser Glu Val 115 120 125 Ser Ser Arg Tyr Ser Leu Leu Ser His Asp Cys Tyr Val Pro Gln Glu 130 135 140 Thr Ser Val Ala Val Gln Ser Thr Arg Asn Lys Gln Gly Arg Ala Ser 145 150 155 160 Glu Gly Ile Ser Pro Glu Gln Gln Gln Glu Val Arg Ala Ala Phe Glu 165 170 175 Ala Gln Gln Lys Ala Ala Cys Ala Ala Tyr Asp Ala Leu Ile Gln Lys 180 185 190 Asn Ile Ala Arg Glu Leu Ala Arg Ile Asn Val Pro Leu Ser Leu Tyr 195 200 205 Thr Glu Trp Tyr Trp Gln Ile Asp Leu His Asn Leu Phe His Phe Leu 210 215 220 Arg Leu Arg Ala Ser Ala His Ala Gln Ala Glu Ile Arg Ala Tyr Ala 225 230 235 240 Glu Val Ile Ile Glu Ile Thr Arg Ala Val Ala Pro Cys Ala Thr Ala 245 250 255 Ser Phe Glu Asn His Glu Lys Asp Gly Val Gln Phe Ser Gly Arg Glu 260 265 270 Phe Ala Ala Leu Lys Ala Leu Leu Ala Gly Glu Gly Leu Ser Leu Glu 275 280 285 Gly Lys Glu Arg Ala Arg Phe Glu Glu Lys Leu Arg Ser Gly Leu Gln 290 295 300 Gln 305 3 294 PRT Rickettsia prowazekii 3 Met His Asn Thr Thr Lys Arg Val Thr Val Pro Ala Leu Glu Ala Met 1 5 10 15 Leu Tyr Glu Thr Ile Lys Val Leu Asp His Gly Phe Ile Arg Val Ile 20 25 30 Asp Tyr Met Gly Asp Asp Ser Ser Ile Val Gln Ala Ala Arg Val Ser 35 40 45 Tyr Gly Lys Gly Thr Lys Gln Leu Asn Gln Asp Lys Gly Leu Ile Asn 50 55 60 Tyr Leu Leu Arg His Tyr His Thr Thr Pro Phe Glu Met Cys Asp Ile 65 70 75 80 Lys Phe His Ile Lys Leu Pro Ile Phe Ile Ala Arg Gln Trp Ile Arg 85 90 95 His Arg Thr Ala Ser Val Asn Glu Tyr Ser Ala Arg Tyr Ser Ile Leu 100 105 110 Gly Asn Glu Phe Tyr Leu Pro Asp Pro Ala Asn Ile Ala Ser Gln Ser 115 120 125 Val Val Asn Lys Gln Cys Arg Ala Gly Asp Ser Val Pro Lys Lys Val 130 135 140 Ser Glu Lys Val Leu Ala Ile Leu Glu Glu Asp Ala Arg Arg Cys Tyr 145 150 155 160 Arg His Tyr Lys Glu Leu Met Asn Ala Asp Glu Asp Gly Asn Ile Leu 165 170 175 Asp Glu Asn Val Ser Gly Ile Ala Arg Glu Leu Ala Arg Ile Asn Leu 180 185 190 Thr Leu Asn Tyr Tyr Thr Glu Trp Tyr Trp Lys Ile Asn Leu His Asn 195 200 205 Leu Leu His Phe Leu Arg Leu Arg Thr Asp Pro Lys Ala Gln Tyr Glu 210 215 220 Ile Arg Val Tyr Ala Glu Lys Ile Leu Asp Ile Val Lys Ala Trp Val 225 230 235 240 Pro Phe Thr Tyr Glu Ala Phe Glu Glu Tyr Arg Leu Gln Gly Ala Asn 245 250 255 Ile Ser Arg Lys Gly Leu Glu Val Ile Lys Arg Met Ile Lys Gly Glu 260 265 270 Lys Val Ile His Glu Thr Ser Gly Met Asn Lys Arg Glu Trp Glu Glu 275 280 285 Leu Val Lys Ile Phe Arg 290 4 220 PRT Thermogata maritima 4 Met Lys Ile Asp Ile Leu Asp Lys Gly Phe Val Glu Leu Val Asp Val 1 5 10 15 Met Gly Asn Asp Leu Ser Ala Val Arg Ala Ala Arg Val Ser Phe Asp 20 25 30 Met Gly Leu Lys Asp Glu Glu Arg Asp Arg His Leu Ile Glu Tyr Leu 35 40 45 Met Lys His Gly His Glu Thr Pro Phe Glu His Ile Val Phe Thr Phe 50 55 60 His Val Lys Ala Pro Ile Phe Val Ala Arg Gln Trp Phe Arg His Arg 65 70 75 80 Ile Ala Ser Tyr Asn Glu Leu Ser Gly Arg Tyr Ser Lys Leu Ser Tyr 85 90 95 Glu Phe Tyr Ile Pro Ser Pro Glu Arg Leu Glu Gly Tyr Lys Thr Thr 100 105 110 Ile Pro Pro Glu Arg Val Thr Glu Lys Ile Ser Glu Ile Val Asp Lys 115 120 125 Ala Tyr Arg Thr Tyr Leu Glu Leu Ile Glu Ser Gly Val Pro Arg Glu 130 135 140 Val Ala Arg Ile Val Leu Pro Leu Asn Leu Tyr Thr Arg Phe Phe Trp 145 150 155 160 Thr Val Asn Ala Arg Ser Leu Met Asn Phe Leu Asn Leu Arg Ala Asp 165 170 175 Ser His Ala Gln Trp Glu Ile Gln Gln Tyr Ala Leu Ala Ile Ala Arg 180 185 190 Ile Phe Lys Glu Lys Cys Pro Trp Thr Phe Glu Ala Phe Leu Lys Tyr 195 200 205 Ala Tyr Lys Gly Asp Ile Leu Lys Glu Val Gln Val 210 215 220 5 260 PRT Dictyostelium discoideum 5 Met Gly Leu Asp Ile Gln Thr Glu Ile Asp Lys Ile Val Ile Glu Lys 1 5 10 15 Val Lys Pro Glu Val Glu Tyr Tyr Asp Val Met Gly Gly Ser His Arg 20 25 30 Trp Glu Val Lys Val His Asp His Gly Lys Val Ala Leu Val Asp Thr 35 40 45 Met Pro Arg Leu Ala Pro Val Gly Gln Thr Ala Asp Phe Ser Ile Cys 50 55 60 Gln Ala Ala Arg Val Ser Tyr Gly Ala Gly Thr Lys Lys Val Thr Glu 65 70 75 80 Asp Lys Gly Leu Ile Arg Tyr Leu Tyr Arg His Gln His Thr Ser Pro 85 90 95 Phe Glu Met Val Glu Phe Lys Phe His Cys Val Met Pro Val Phe Ile 100 105 110 Ala Arg Gln Trp Ile Arg His Arg Thr Ala Asn Val Asn Glu Tyr Ser 115 120 125 Ala Arg Tyr Ser Val Leu Pro Asp Lys Phe Tyr His Pro Ser Ile Glu 130 135 140 Glu Val Arg Lys Gln Ser Thr Ser Asn Arg Gln Gly Gly Glu Glu Ala 145 150 155 160 Leu Glu Pro Lys Thr Ala Gln Glu Phe Leu Asp Tyr Leu Asp Lys Val 165 170 175 Glu Glu Asn Tyr Lys Thr Tyr Asn Glu Leu Leu Glu Lys Gly Leu Ser 180 185 190 Arg Glu Leu Gly Arg Ile Gly Leu Pro Val Ser Ile Tyr Thr Glu Trp 195 200 205 Tyr Trp Lys Ile Asp Leu His Asn Leu Phe His Phe Leu Arg Leu Arg 210 215 220 Met Asp Ser His Ser Gln Lys Glu Ile Arg Asp Tyr Ala Asn Thr Ile 225 230 235 240 Phe Ala Leu Ile Arg Pro Ile Val Pro Val Ala Cys Glu Gly Ile Tyr 245 250 255 Arg Leu Cys Phe 260 6 222 PRT Roseophage 6 Met Thr Gln Ile Glu Ala Thr Tyr Ile Asp His Met Gly Ser Asp Leu 1 5 10 15 Ser Val Val Asn Ala Ala Arg Val Ser Phe Gly Lys Lys Ser Glu Trp 20 25 30 Val Tyr Cys Gly Gln Ser Asp Gly Arg Asp Lys Gly Leu Ser Gly Arg 35 40 45 Asp Thr Lys Leu Ile Lys Tyr Leu Ala Lys His Lys His Ile Ser Pro 50 55 60 Phe Gly His Ala Phe Ala Ser Phe His Val Lys Ala Pro Ile Phe Val 65 70 75 80 Ala Arg Gln Leu Val Lys His Lys Phe Leu Arg Trp Asn Glu Ile Ser 85 90 95 Arg Arg Tyr Val Asp Asp Glu Pro Glu Phe Tyr Thr Pro Asp Val Trp 100 105 110 Arg Gly Arg Ser Ala Asp Lys Lys Gln Gly Ser Asp Gly Val Val Asn 115 120 125 Pro Glu Tyr Asn Pro Gln Tyr Leu Asp Asn Lys Ile Lys Phe Ala Tyr 130 135 140 Leu Gln Ala Leu Asp Ile Gly Ile Ser Pro Glu Gln Ala Arg Met Leu 145 150 155 160 Leu Pro Gln Ser Thr Met Thr Glu Trp Tyr Trp Ser Gly Ser Leu Asp 165 170 175 Ala Phe Ala Asp Met Cys Arg Leu Arg Cys Lys Glu Asp Thr Gln Tyr 180 185 190 Glu Ser Arg Val Val Ala Asp Gln Ile Ser Glu Lys Met Ala Asp Leu 195 200 205 Tyr Pro Val Ser Trp Ala Ala Leu Met Glu Gly Glu Lys Gln 210 215 220 7 246 PRT Streptomyces coelicolor 7 Met Thr Asp Ile Pro Ala Asp Asp Pro Lys Ile Glu Leu Arg Ser Asp 1 5 10 15 Ile Thr Val Glu Leu Val Lys Ser Ala Ala Thr Asp Ser Asp Val Leu 20 25 30 Phe Ala Ala Arg Val Ser Thr Ala Gly Glu Gln Ser Leu Asp Glu Leu 35 40 45 Lys Lys Asp Pro Glu Arg Ser Lys Gly Leu Ile Asn Tyr Leu Met Arg 50 55 60 Asp Arg His Gly Ser Pro Phe Glu His Asn Ser Met Thr Phe Phe Val 65 70 75 80 Ser Ala Pro Ile Phe Val Phe Arg Glu Phe Met Arg His Arg Val Gly 85 90 95 Trp Ser Tyr Asn Glu Glu Ser Gly Arg Tyr Arg Glu Leu Gln Pro Val 100 105 110 Phe Tyr Ala Pro Asp Ala Ser Arg Lys Leu Val Gln Gln Gly Arg Pro 115 120 125 Gly Lys Tyr Val Phe Val Glu Gly Thr Pro Glu Gln His Glu Leu Val 130 135 140 Gly Ser Ala Met Glu Asp Ser Tyr Arg Gln Ala Tyr Ala Thr Tyr Gln 145 150 155 160 Gln Met Leu Ala Ala Gly Val Ala Arg Glu Val Ala Arg Ala Val Leu 165 170 175 Pro Val Gly Leu Tyr Ser Ser Met Tyr Ala Thr Cys Asn Ala Arg Ser 180 185 190 Leu Met His Phe Leu Gly Leu Arg Thr Gln His Glu Leu Ala Lys Val 195 200 205 Pro Ser Phe Pro Gln Arg Glu Ile Glu Met Ala Gly Glu Lys Met Glu 210 215 220 Ala Glu Trp Ala Arg Leu Met Pro Leu Thr His Ala Ala Phe Asn Ala 225 230 235 240 Asn Gly Arg Val Ala Pro 245 8 317 PRT Aquifex aeolicus 8 Met Met Lys Ile Tyr Leu Met Gly Ser Asp Gln Arg Ile Val Arg Cys 1 5 10 15 Ala Arg Val Ser Phe Ala Lys Asp Ser Tyr Val Asp Glu Lys Arg Asp 20 25 30 Lys Arg Leu Ile Arg Tyr Leu Phe Lys His Arg His Ala Ser Pro Phe 35 40 45 Glu His Asn Ile Ile Ala Phe Glu Trp Lys Lys Glu Lys Trp Ile Glu 50 55 60 Leu Leu Ser Lys Leu Glu Asn Pro Thr Val Gln Val Tyr Tyr Ser Asn 65 70 75 80 Gly Phe Val Phe Leu Asn Leu Arg Asn Ala Ile Asn Val Trp Glu Leu 85 90 95 Leu Pro Asp Ala Val Lys Glu Arg Ile Lys Glu Ala Phe Pro Thr Thr 100 105 110 Tyr Gly Val Ile Gln Arg Arg Gly Glu Ile Glu Asp Glu Glu Leu Tyr 115 120 125 Ser Leu Pro Tyr Thr Lys Asp Lys Ala Tyr Val Lys Glu Lys Ile Glu 130 135 140 Thr Ser Ser Gly Trp Ile Gly Leu Val Asp Lys Leu Glu Leu Glu Thr 145 150 155 160 Asp Met Asp Phe Tyr Thr Phe Val Val Glu Cys Pro Leu Phe Val Ala 165 170 175 Arg Gln Trp Met Arg His Arg Phe Gly Ser Tyr Asn Glu Val Ser Lys 180 185 190 Arg Tyr Val Gly Lys Glu Phe Leu Glu Phe Tyr Leu Pro Lys Tyr Ile 195 200 205 Arg Lys Gln Ala Glu Lys Asn Lys Gln Ala Ser Val Asp Glu Pro Ile 210 215 220 Ser Glu Ser Glu Val Phe Ile Lys Lys Ile Glu Asn Leu Ile Ser Lys 225 230 235 240 Ser Val Lys Leu Tyr Glu Glu Ile Ile Glu Lys Gly Gly Ala Lys Glu 245 250 255 Leu Ala Arg Gly Val Leu Pro Gln Phe Met Lys Thr Arg Phe Tyr Trp 260 265 270 Thr Val Pro Arg Ile Ser Leu Asp Asn Phe Ile Thr Leu Arg Thr His 275 280 285 Glu Gly Ala Gln Lys Glu Ile Arg Glu Phe Ala Glu Ala Ile Lys Glu 290 295 300 Met Val Gly Tyr Arg Gly Thr Asp Lys Lys Asn Val Ile 305 310 315 9 216 PRT Chorella virus 9 Met Ser Ala Lys Leu Ile Ser Val Thr Lys Pro Val Val Glu Gly Val 1 5 10 15 Asn Thr Ala Glu Glu Leu Ile Ala Tyr Ala Ala Arg Val Ser Asn Pro 20 25 30 Glu Asn Gln Ile Asn Asn Lys Thr Ala Ser Gly Leu Leu Lys Tyr Cys 35 40 45 Ile Arg His Lys His Trp Ser Ile Phe Glu Thr Ala Phe Met Thr Leu 50 55 60 Glu Leu Lys Thr Ser Arg Gly Ile Ala Ala Gln Val Leu Arg His Arg 65 70 75 80 Ser Phe His Phe Gln Glu Phe Ser Gln Arg Tyr Ala Ser Val Met Glu 85 90 95 Thr Pro Pro Pro His Gln Ala Arg Phe Gln Asp His Lys Asn Arg Gln 100 105 110 Asn Ser Leu Asp Thr Val Pro Glu Asp Asp Gln Thr Trp Trp Ala Thr 115 120 125 Glu Gln Glu Lys Leu Tyr Ala Gln Ser Met Glu Leu Tyr Asn Lys Ala 130 135 140 Leu Glu Lys Gly Ile Ala Lys Glu Cys Ala Arg Phe Ile Leu Pro Leu 145 150 155 160 Ser Thr Pro Thr Thr Ile Tyr Met Ser Gly Thr Ile Arg Asp Trp Ile 165 170 175 His Tyr Ile Glu Leu Arg Thr Ser Asn Gly Thr Gln Arg Glu His Ile 180 185 190 Asp Leu Ala Asn Ala Cys Lys Glu Ile Phe Ile Lys Glu Phe Pro Ser 195 200 205 Ile Ala Lys Ala Leu Asp Trp Val 210 215 10 215 PRT Synechocystis sp. 10 Met Asp Val Arg Phe Ile Ser Leu Thr Lys Pro Glu Ile Val Ile Asp 1 5 10 15 Gly Glu Pro Leu Ser Pro Glu Gly Leu Ile Ala Tyr Cys Ala Arg Val 20 25 30 Ser Ser Pro Asn Gln Glu Asn Pro Asn Tyr Thr Lys Leu Leu Gln Phe 35 40 45 Cys Ile Arg Glu Gly His Trp Ser Ile Phe Glu Met Val Asp Met Thr 50 55 60 Leu Glu Ile Thr Thr Thr Arg Ala Ile Ala Pro Gln Ile Leu Arg His 65 70 75 80 Arg Ser Phe Ser Phe Gln Glu Phe Ser Leu Arg Tyr Ser Cys Ala Thr 85 90 95 Glu Tyr Glu Cys Tyr Glu Ala Arg Arg Gln Asp Val Lys Asn Arg Gln 100 105 110 Asn Ser Leu Asp Asp Phe Asp Glu Ser Thr Lys Lys Trp Phe Asn Gln 115 120 125 Ala Gln Ala Ala Val Trp Glu Lys Ser His Gln Leu Tyr Glu Glu Ala 130 135 140 Leu Ala Lys Gly Ile Ala Lys Glu Cys Ala Arg Ser Ile Leu Pro Leu 145 150 155 160 Asn Thr Val Thr Arg Leu Tyr Met Lys Gly Ser Val Arg Ser Trp Ile 165 170 175 His Tyr Phe Ser Val Arg Cys Asp Gln Ala Thr Gln Lys Glu His Arg 180 185 190 Glu Ile Ala Leu Ala Ala Arg Lys Ile Phe Met Lys His Phe Pro Thr 195 200 205 Val Ala Ala Ala Leu Glu Trp 210 215 11 243 PRT Pyrococcus horikoshii 11 Met Val Lys Val Lys Leu Ile Asn Tyr Thr Pro Lys Pro Leu Glu Thr 1 5 10 15 Val Thr Trp Ala Ala Leu Ile Ser Tyr Trp Asp Gly Trp Ser Thr Glu 20 25 30 Ala Phe Glu Lys Ile Ser Pro Asn Asp Val Glu Ile His Leu Pro Arg 35 40 45 Ile Leu Ser Tyr Gly His Glu Ser Ile Leu Glu His Ala Thr Phe Thr 50 55 60 Phe Ser Ile Glu Gly Cys Ser Arg Val Cys Thr His Gln Leu Val Arg 65 70 75 80 His Arg Ile Ala Ser Tyr Thr Gln Gln Ser Gln Arg Tyr Ile Lys Ile 85 90 95 Asn Pro Glu Asp Val Glu Glu Thr Phe Val Ile Pro Glu Ser Ile Lys 100 105 110 Lys Asp Ser Glu Leu Leu Lys Glu Trp Lys Glu Leu Leu Lys Arg Ser 115 120 125 Leu Glu Leu Tyr Glu Lys Ser Ile Glu Arg Gly Ile His Gln Glu Asp 130 135 140 Ala Arg Phe Ile Leu Pro Gln Ser Val Lys Thr Lys Ile Val Val Thr 145 150 155 160 Met Asn Leu Arg Glu Leu Lys His Phe Phe Gly Leu Arg Leu Cys Glu 165 170 175 Arg Ala Gln Trp Glu Ile Arg Glu Val Ala Trp Lys Met Leu Glu Glu 180 185 190 Ile Ala Lys Arg Lys Glu Leu Lys Pro Ile Ile Glu Trp Ala Lys Leu 195 200 205 Gly Pro Arg Cys Ile Gln Leu Gly Tyr Cys Pro Glu Arg Glu Leu Met 210 215 220 Pro Pro Gly Cys Leu Lys Arg Thr Arg Glu Arg Trp Lys Asn Leu Leu 225 230 235 240 Glu Lys Tyr 12 244 PRT Pyrococcus abyssi 12 Met Val Arg Val Thr Leu Val Asn Tyr Thr Arg Arg Pro Leu Glu Thr 1 5 10 15 Ile Thr Trp Ala Ala Leu Val Ser Tyr Trp Asp Glu Trp Ser Thr Glu 20 25 30 Ser Phe Glu Lys Ile Asn Glu Asp Asp Val Lys Ala His Leu Pro Arg 35 40 45 Ile Leu Gly Tyr Gly His Glu Ser Ile Leu Glu His Ala Thr Phe Thr 50 55 60 Phe Ser Ile Glu Gly Cys Ser Arg Val Cys Thr His Gln Leu Val Arg 65 70 75 80 His Arg Ile Ala Ser Tyr Thr Gln Gln Ser Gln Arg Tyr Ile Val Leu 85 90 95 Asn Glu Glu Asn Val Glu Glu Thr Phe Val Ile Pro Glu Ser Ile Lys 100 105 110 Lys Asp Arg Glu Leu Tyr Glu Lys Trp Lys Lys Ala Met Ala Glu Thr 115 120 125 Ile Lys Leu Tyr Lys Glu Ser Leu Lys Arg Gly Ile His Gln Glu Asp 130 135 140 Ala Arg Phe Ile Leu Pro Gln Ala Val Arg Ser Lys Ile Val Val Thr 145 150 155 160 Met Asn Leu Arg Glu Leu Lys His Phe Phe Gly Leu Arg Leu Cys Glu 165 170 175 Arg Ala Gln Trp Glu Ile Arg Glu Val Ala Trp Lys Met Leu Glu Glu 180 185 190 Ile Ala Lys Arg Glu Glu Leu Arg Pro Ile Ile Lys Trp Ala Lys Leu 195 200 205 Gly Pro Arg Cys Ile Gln Leu Gly Tyr Cys Pro Glu Arg Glu Leu Met 210 215 220 Pro Pro Gly Cys Phe Lys Arg Thr Arg Glu Arg Trp Met Lys Leu Leu 225 230 235 240 Glu Lys Pro Leu 13 266 PRT Halobacterium sp. 13 Met Val Pro Ala Arg Gly Phe Gly Val Phe Leu Pro Pro Ala Gly Thr 1 5 10 15 Pro Ser Ser Met Arg Val Arg Leu Leu Glu Ala Thr Glu Asn Pro Glu 20 25 30 Glu Leu Ile Cys Gln Ser Ala Arg Asn Asp Tyr Met Ser Asp Trp Val 35 40 45 Gly Asp Thr Pro Leu Asp Thr Ala Met Ala Ser Val Asp Gly Asp Thr 50 55 60 Thr Asp Glu Lys Leu Ser Asn Leu Ile Ala Gln Leu Leu Thr Arg Gly 65 70 75 80 His Tyr Gly Pro Phe Glu His Pro Ser Ala Thr Phe Ala Ile Glu Gly 85 90 95 Val Ser Arg Ser Cys Met Ala Gln Leu Thr Arg His Arg His Ala Ser 100 105 110 Phe Asp Val Gln Ser Met Arg Tyr Val Ala Phe Asp Asp Val Asp Pro 115 120 125 Ala Ala Val Ala Glu Gly Glu Leu Val Val Thr Pro Pro Ser Ala Thr 130 135 140 Asp Pro Asp Trp Val Gly Arg Asn Gln Asp Ala Gly Asp Ile Asp Glu 145 150 155 160 Glu Thr Met Ala Glu Arg Glu Ala Val Phe Gln Ala Ser Val Arg Arg 165 170 175 Ala Val Glu Asp Tyr Gln Glu Leu Leu Gly Leu Gly Met Pro Pro Glu 180 185 190 Asp Ala Arg Phe Val Leu Pro Ile Gly Thr Glu Val Asn Val Val Ile 195 200 205 Thr Leu Asn Pro Arg Ser Leu Met His Val Ala Asp Met Arg Ala Ala 210 215 220 Ala Asp Ala Gln Trp Glu Ile Arg Glu Leu Thr Glu Gln Leu Leu Asp 225 230 235 240 Ala Ala Ala Gln Trp Cys Pro His Thr Phe Glu Tyr Tyr Asp Ala Glu 245 250 255 Met Lys His Arg Lys Asn Arg Leu Ala Pro 260 265 14 235 PRT Mycobacteriophage D29 14 Met Lys Val Gln Leu Ile Ala Ser Thr Ile Leu Glu Asp Pro Ser Trp 1 5 10 15 Ala Gly Thr Asp Tyr Val Gly Asp Asp Glu Thr Val Thr Ser Ala Asp 20 25 30 Glu Leu Ala Glu Phe Ala Gly Arg Asn Cys Tyr Leu Ser Phe Asp Arg 35 40 45 Pro Asn Pro Lys Thr Arg Glu Asn Val Asp Tyr Leu Asn His Ile Leu 50 55 60 Asp Val Gly His Glu Ser Val Leu Glu His Ser Ser Ala Thr Phe Tyr 65 70 75 80 Ile Glu Ala Ser Arg Ser Val Leu Thr Glu Leu Glu Arg His Arg His 85 90 95 Leu Ser Phe Ser Val Val Ser Gln Arg Tyr Val Asp Pro Thr Glu Leu 100 105 110 Gly Ile His Val Pro Pro Ala Phe Thr Glu Leu Ser Gly Ser Asp Ala 115 120 125 Asp Lys Ala Lys Glu Val Leu Leu Asp Val Gln Ser Phe Ala Gln Glu 130 135 140 Ala Tyr Glu Tyr Leu Val His Ile Phe Ser Asp Ala Gly Phe Pro Arg 145 150 155 160 Lys Lys Ala Arg Glu Ala Ala Arg Ala Val Leu Pro Asn Met Thr Asn 165 170 175 Ser Pro Met Val Val Thr Gly Asn His Arg Ala Trp Arg Tyr Val Ile 180 185 190 Lys Asn Arg Trp His Glu Ala Ala Asp Ala Glu Ile Arg Glu Leu Ala 195 200 205 Gly Glu Leu Leu Arg Gln Leu Arg Glu Ile Ala Pro Asn Thr Tyr Gln 210 215 220 Asp Ile Pro Thr Glu Pro Tyr Ser Tyr Gly Gly 225 230 235 15 250 PRT Mycobacterium tuberculosis 15 Met Ala Glu Gln Val Lys Leu Ser Val Glu Leu Ile Ala Cys Ser Ser 1 5 10 15 Phe Thr Pro Pro Ala Asp Val Glu Trp Ser Thr Asp Val Glu Gly Ala 20 25 30 Glu Ala Leu Val Glu Phe Ala Gly Arg Ala Cys Tyr Glu Thr Phe Asp 35 40 45 Lys Pro Asn Pro Arg Thr Ala Ser Asn Ala Ala Tyr Leu Arg His Ile 50 55 60 Met Glu Val Gly His Thr Ala Leu Leu Glu His Ala Asn Ala Thr Met 65 70 75 80 Tyr Ile Arg Gly Ile Ser Arg Ser Ala Thr His Glu Leu Val Arg His 85 90 95 Arg His Phe Ser Phe Ser Gln Leu Ser Gln Arg Phe Val His Ser Gly 100 105 110 Glu Ser Glu Val Val Val Pro Thr Leu Ile Asp Glu Asp Pro Gln Leu 115 120 125 Arg Glu Leu Phe Met His Ala Met Asp Glu Ser Arg Phe Ala Phe Asn 130 135 140 Glu Leu Leu Asn Ala Leu Glu Glu Lys Leu Gly Asp Glu Pro Asn Ala 145 150 155 160 Leu Leu Arg Lys Lys Gln Ala Arg Gln Ala Ala Arg Ala Val Leu Pro 165 170 175 Asn Ala Thr Glu Ser Arg Ile Val Val Ser Gly Asn Phe Arg Thr Trp 180 185 190 Arg His Phe Ile Gly Met Arg Ala Ser Glu His Ala Asp Val Glu Ile 195 200 205 Arg Glu Val Ala Val Gly Cys Leu Arg Lys Leu Gln Val Ala Ala Pro 210 215 220 Thr Val Phe Gly Asp Phe Glu Ile Glu Thr Leu Ala Asp Gly Ser Gln 225 230 235 240 Met Ala Thr Ser Pro Tyr Val Met Asp Phe 245 250 16 250 PRT Corynebacterium glutamicum 16 Met Ala Glu Gln Val Lys Leu Ser Val Glu Leu Ile Ala Cys Ser Ser 1 5 10 15 Phe Thr Pro Pro Ala Asp Val Glu Trp Ser Thr Asp Val Glu Gly Ala 20 25 30 Glu Ala Leu Val Glu Phe Ala Gly Arg Ala Cys Tyr Glu Thr Phe Asp 35 40 45 Lys Pro Asn Pro Arg Thr Ala Ser Asn Ala Ala Tyr Leu Arg His Ile 50 55 60 Met Glu Val Gly His Thr Ala Leu Leu Glu His Ala Asn Ala Thr Met 65 70 75 80 Tyr Ile Arg Gly Ile Ser Arg Ser Ala Thr His Glu Leu Val Arg His 85 90 95 Arg His Phe Ser Phe Ser Gln Leu Ser Gln Arg Phe Val His Ser Gly 100 105 110 Glu Ser Glu Val Val Val Pro Thr Leu Ile Asp Glu Asp Pro Gln Leu 115 120 125 Arg Glu Leu Phe Met His Ala Met Asp Glu Ser Arg Phe Ala Phe Asn 130 135 140 Glu Leu Leu Asn Ala Leu Glu Glu Lys Leu Gly Asp Glu Pro Asn Ala 145 150 155 160 Leu Leu Arg Lys Lys Gln Ala Arg Gln Ala Ala Arg Ala Val Leu Pro 165 170 175 Asn Ala Thr Glu Ser Arg Ile Val Val Ser Gly Asn Phe Arg Thr Trp 180 185 190 Arg His Phe Ile Gly Met Arg Ala Ser Glu His Ala Asp Val Glu Ile 195 200 205 Arg Glu Val Ala Val Gly Cys Leu Arg Lys Leu Gln Val Ala Ala Pro 210 215 220 Thr Val Phe Gly Asp Phe Glu Ile Glu Thr Leu Ala Asp Gly Ser Gln 225 230 235 240 Met Ala Thr Ser Pro Tyr Val Met Asp Phe 245 250 17 243 PRT Mycobacteriophage l5 17 Met Lys Ala Lys Leu Ile Ala Ala Thr Glu Ile Asp Pro Gly Ala Leu 1 5 10 15 Arg Asp Ile Gly Phe Glu Val Asp Asp Phe Glu Glu Ser Lys Asp Glu 20 25 30 Asp Pro Tyr Phe Gly Asp Phe Asp Ala Asp Glu Leu Ala Glu Phe Ala 35 40 45 Gly Arg Asn Cys Tyr Arg Ser Phe His Arg Pro Asn Pro Ala Thr Ala 50 55 60 Glu Asn Glu Asp Tyr Leu Asn His Ile Ile Asp Leu Gly His Glu Ser 65 70 75 80 Val Phe Glu His Ala Ser Ala Thr Phe Tyr Ile Glu Ala Ser Arg Ser 85 90 95 Val Leu Thr Glu Leu Glu Arg His Arg His Leu Ser Phe Ser Val Val 100 105 110 Ser Gln Arg Tyr Val Asp Pro Thr Asp Leu Gly Ile His Leu Pro Pro 115 120 125 Ala Leu Phe Lys Leu His Pro Asp Asp Arg Asp Asp Leu Val His Ile 130 135 140 Met Glu Ser Val Ser Ser Glu Ile Asp Ala Val Tyr Glu His Ile Val 145 150 155 160 Asn Arg Leu Ala Asp Arg Gly Leu Pro Arg Lys Gln Ala Arg Glu Ala 165 170 175 Ala Arg Ala Val Leu Pro Asn Met Thr Asn Ser Pro Met Val Val Thr 180 185 190 Gly Asn His Arg Ala Trp Arg Tyr Val Ile Lys Ala Arg Trp His Glu 195 200 205 Ala Ala Asp Ala Glu Ile Arg Glu Leu Ala Gly Glu Leu Leu Arg Gln 210 215 220 Leu Arg Gln Ile Ala Pro Asn Thr Tyr Gln Asp Ile Pro Asp Val Pro 225 230 235 240 Tyr Ser Tyr 18 237 PRT Bacteriophage phi-C31 18 Met Lys Val Asn Val Leu Ala Thr Thr Ala Leu Asn Pro Ser Pro Leu 1 5 10 15 Leu Asp Ala Tyr Glu Tyr Arg Val Ser Gly Ala Ala Tyr Asn Arg Asp 20 25 30 Arg Pro Thr Asp Ala Asp Ala Leu Gly Glu Ala Ala Gly Arg Ile Cys 35 40 45 Tyr Lys Ser Phe Glu Arg Lys Asn Pro Ala Thr Ala Ser Asn Pro Gly 50 55 60 Tyr Leu Gly Asn Ile Leu Ala Gln Gly His Phe Ser Val Leu Glu His 65 70 75 80 Ala Ser Val Thr Phe Leu Val Arg Asp Val Ser Arg Ala Leu Leu Thr 85 90 95 Glu Leu Ser Arg His Arg His Leu Ser Phe Ser Val Val Ser Gln Arg 100 105 110 Tyr Val Asp His Ala Asp Thr Glu Pro Val Val Pro Pro Ala Ile Arg 115 120 125 Gly Thr Glu Leu Glu Lys Pro Phe Arg Glu Asp Tyr Ala Glu Ala Leu 130 135 140 Gln Ala Tyr Asp Ala Gly Val Lys Leu Leu Arg Ala Arg Gly Tyr Gly 145 150 155 160 Arg Lys Gln Ala Arg Glu Ala Ala Arg Ala Leu Leu Pro Asn Ala Ala 165 170 175 Pro Val Asp Met Val Val Thr Gly Asn Leu Arg Ala Trp Arg Asp Val 180 185 190 Leu Gly Lys Arg Trp His Val Ala Ala Asp Ala Glu Ile Arg Glu Phe 195 200 205 Ala Gly Arg Val Leu Asp His Leu His Ala Val Ala Pro Asn Ser Val 210 215 220 Gln Asp Met Pro Thr Ser Pro Phe Gly Ser Asp Gly Lys 225 230 235 19 327 PRT Aeropyrum pernix 19 Met Ser Leu Ala Ala Ser Leu Glu Lys Ala Gly Leu Gly Ile Ser Val 1 5 10 15 Arg Leu Leu Glu Tyr Thr Gly Asp Gly Glu Arg Ile Val Ala Val Ala 20 25 30 Ser Lys Val Ser Leu Ser Arg Ser Pro Ala Glu Arg Leu Leu Ala Ile 35 40 45 Gly Glu Asp Glu Val Glu Thr Trp Ile Leu Glu Thr Phe Arg Arg Gln 50 55 60 His Phe Ser Pro Trp Glu His Ser Val Tyr Thr Phe Met Val Glu Gly 65 70 75 80 Leu Ser Arg Val Ala Ser His Gln Leu Val Arg His Arg Val Ala Ser 85 90 95 Tyr Thr Gln Leu Ser His Arg Tyr Ser Glu Gly Tyr Leu Arg Glu Ala 100 105 110 Ala Leu Lys Ala Cys Glu Ser Ile Gly Leu Asp Cys Pro Ser Lys Pro 115 120 125 Ala Glu Thr Glu Gly Gly Arg Lys Ala Ala Tyr Arg Leu Tyr Ser Gln 130 135 140 Ala Leu Glu Arg Ala Ala Arg Asp Phe Gly Ala Ser Glu Arg Phe Ala 145 150 155 160 Ile Ala Ala Lys Ala Phe Val Ile Pro Pro Thr Ile Leu Ala Arg Gly 165 170 175 Asp Gly Gly Asp Gly Val Val Glu Ala Tyr Leu Arg Ser Ala Ala Ile 180 185 190 Tyr Tyr Ser Leu Leu Ser Arg Gly Ala Arg Arg Glu Asp Ala Arg Tyr 195 200 205 Ile Leu Pro Asp Ala Leu Arg Thr Arg Ile Val Val Thr Met Asn Ala 210 215 220 Arg Glu Leu Ile Gln Val Phe Phe Pro Leu Arg Met Cys Thr Arg Ala 225 230 235 240 Gln Trp Glu Ile Arg His Ile Ala Trp Leu Leu Trp Arg Glu Leu Ser 245 250 255 Arg Val His Pro Arg Leu Phe Arg Trp Ala Gly Pro Ser Cys Val Leu 260 265 270 Arg Glu Asn Thr Leu Arg Thr Thr Pro Ala Ser Leu Tyr Ser Tyr Leu 275 280 285 Glu Gly Val Glu Arg Phe Thr Gln Pro Arg Cys Pro Glu Leu Val Glu 290 295 300 Asn Lys Ala Ile Pro Gly Cys Leu Arg Gln Ala Ala Ser Val Ala Pro 305 310 315 320 Pro Gly Asp Gly Glu Tyr Glu 325 20 269 PRT Treponema denticola 20 His Ser Pro Met Ala His Cys Ile Ala Pro Glu Ala Glu Lys Ile Leu 1 5 10 15 Asp Lys Glu Phe Lys Val Leu Asp Lys Gly Phe Ile Arg Leu Val Asp 20 25 30 Tyr Met Gly Thr Asp Ala Arg Ile Val Gln Ser Ala Arg Val Ser Tyr 35 40 45 Gly Glu Gly Thr Lys Thr Val Arg Glu Asp Ala Ala Leu Ile Asp Tyr 50 55 60 Leu Leu Arg Asn Lys His Thr Ser Pro Phe Glu Gln Val Val Phe Thr 65 70 75 80 Phe His Val Lys Leu Pro Ile Phe Val Ala Arg Gln Trp Ile Arg His 85 90 95 Arg Thr Ala Arg Leu Asn Glu Ile Ser Gly Arg Tyr Ser Ile Leu Lys 100 105 110 Ala Glu Phe Tyr Val Pro Ala Gly Lys Asp Ile Ala Leu Gln Ser Ser 115 120 125 Asp Asn Lys Gln Gly Arg Met Asn Glu Ala Val Pro Gln Asp Leu Gln 130 135 140 Asn Glu Val Ile Thr Ser Leu Gln Lys Gln Gln Glu Glu Ile Tyr Ala 145 150 155 160 Gly Tyr Ser Lys Leu Leu Asp Lys Asn Ile Ala Arg Glu Leu Ala Arg 165 170 175 Ile Asn Leu Pro Leu Ser Thr Tyr Thr Glu Trp Tyr Trp Gln Ile Asp 180 185 190 Leu His Asn Leu Phe His Phe Leu Arg Leu Arg Met Asp Ala His Ala 195 200 205 Gln Lys Glu Ile Arg Asp Tyr Ala Glu Val Met Phe Glu Ile Cys Lys 210 215 220 Thr Val Thr Pro Leu Ala Cys Ala Ser Phe Glu Arg His Glu Lys Asn 225 230 235 240 Gly Val Asn Phe Ser Ala Glu Glu Leu Glu Ala Ile Arg Asn Leu Ile 245 250 255 Ala Gly Lys Asp Ser Gly Leu Lys Gly Lys Glu Leu Glu 260 265 21 231 PRT Helicobacter pylori 21 Met Trp Ile Thr Gln Glu Thr Trp Leu Lys Ala Leu Pro Trp Asn Lys 1 5 10 15 Lys Arg Tyr Arg Ser Gln Ile Met Glu Val Ile Cys Lys His Tyr Thr 20 25 30 Pro Leu Asp Ile Ala Ser Gln Ala Ile Arg Thr Cys Trp Gln Ser Phe 35 40 45 Glu Tyr Ser Asp Asp Gly Gly Cys Lys Asp Lys Glu Leu Ile His Arg 50 55 60 Val Gly Asn Ile Phe Arg His Ser Ser Thr Leu Glu His Leu Tyr Tyr 65 70 75 80 Asn Phe Glu Ile Lys Gly Leu Ser Arg Gly Ala Leu Gln Glu Leu Ser 85 90 95 Arg His Arg Ile Ala Ser Leu Ser Val Lys Ser Ser Arg Tyr Thr Leu 100 105 110 Arg Glu Leu Lys Glu Val Glu Ser Phe Leu Pro Leu Asn Glu Thr Asn 115 120 125 Leu Glu Arg Ala Lys Glu Phe Leu Val Phe Val Asp Asn Glu Lys Val 130 135 140 Asn Ala Met Ser Val Leu Ala Leu Glu Asn Leu Arg Ile Leu Leu Ser 145 150 155 160 Glu His Asn Ile Lys Asn Asp Leu Ala Lys Tyr Ala Met Pro Glu Ser 165 170 175 Tyr Lys Thr His Leu Ala Tyr Ser Ile Asn Ala Arg Ser Leu Gln Asn 180 185 190 Phe Leu Thr Leu Arg Ser Ser Asn Lys Ala Leu Lys Glu Met Gln Asp 195 200 205 Leu Ala Lys Ala Leu Phe Asp Ala Leu Pro Gly Glu His Gln Tyr Leu 210 215 220 Phe Glu Asp Cys Leu Lys His 225 230 22 241 PRT Pyrococcus furiosus 22 Met Val Arg Val Thr Leu Val Asn Tyr Thr Lys Arg Pro Leu Glu Thr 1 5 10 15 Ile Thr Trp Ala Ala Leu Ile Ser Tyr Trp Gly Glu Trp Ser Thr Glu 20 25 30 Ser Phe Glu Arg Ile Ser Glu Asn Asp Val Glu Lys His Leu Pro Arg 35 40 45 Ile Leu Gly Tyr Gly His Glu Ser Ile Leu Glu His Ala Thr Phe Thr 50 55 60 Phe Ser Ile Glu Gly Cys Ser Arg Val Cys Thr His Gln Leu Val Arg 65 70 75 80 His Arg Ile Ala Ser Tyr Thr Gln Gln Ser Gln Arg Tyr Ile Val Leu 85 90 95 Asp Glu Glu Asn Val Glu Glu Thr Phe Val Ile Pro Glu Ser Ile Lys 100 105 110 Lys Asp Arg Glu Leu Tyr Glu Lys Trp Lys Lys Val Met Ala Glu Thr 115 120 125 Ile Ser Leu Tyr Lys Glu Ser Ile Asn Arg Gly Val His Gln Glu Asp 130 135 140 Ala Arg Phe Ile Leu Pro Gln Ala Val Lys Thr Lys Ile Ile Val Thr 145 150 155 160 Met Asn Leu Arg Glu Leu Lys His Phe Phe Gly Leu Arg Leu Cys Glu 165 170 175 Arg Ala Gln Trp Glu Ile Arg Glu Val Ala Trp Lys Met Leu Glu Glu 180 185 190 Met Ala Lys Arg Asp Asp Ile Arg Pro Ile Ile Lys Trp Ala Lys Leu 195 200 205 Gly Pro Arg Cys Ile Gln Phe Gly Tyr Cys Pro Glu Arg Asp Leu Met 210 215 220 Pro Pro Gly Cys Leu Lys Lys Thr Arg Lys Lys Trp Glu Lys Val Ala 225 230 235 240 Glu 23 282 PRT Rhodobacter capsulatus 23 Thr Thr Leu Arg Ala Thr Ser Ala Gly Met Glu Ala Arg Leu Tyr Gln 1 5 10 15 Ala His Pro Val Leu Asp His Gly Leu Ile Arg Val Ile Asp Tyr Met 20 25 30 Gly Asp Asp Ala Ala Ile Cys Gln Ala Ala Arg Val Ser Tyr Gly Arg 35 40 45 Gly Thr Lys Ala Val Ser Asp Asp Arg Gly Leu Ile Arg Tyr Leu Met 50 55 60 Arg His Trp His Ser Thr Pro Phe Glu Met Cys Glu Val Lys Phe His 65 70 75 80 Val Lys Leu Pro Ile Phe Val Ala Arg Gln Trp Ile Arg His Arg Thr 85 90 95 Ala Asn Val Asn Glu Tyr Ser Ala Arg Tyr Ser Val Met Asp Arg Glu 100 105 110 Phe Tyr Ile Pro Ala Pro Glu His Leu Ala Ala Gln Ser Thr Val Asn 115 120 125 Asn Gln Gly Arg Gly Gln Val Leu Glu Gly Ala Glu Ala Ala Arg Val 130 135 140 Leu Asp Leu Leu Arg Glu Asp Ala Met Arg Ala Tyr Asp His Tyr Glu 145 150 155 160 Asp Met Leu Thr Pro Asp Ala Asp Ala Gly Lys Leu Gly Leu Ala Arg 165 170 175 Glu Leu Ala Arg Met Asn Leu Pro Ala Asn Val Tyr Thr Gln Trp Tyr 180 185 190 Trp Lys Ile Asp Leu His Asn Leu Phe His Phe Leu Arg Leu Arg Ala 195 200 205 Asp Val His Ala Gln Tyr Glu Ile Arg Val Tyr Ala Gln Ile Met Gly 210 215 220 Asp Ile Val Lys Asp Trp Val Pro Gln Ala Tyr Glu Ala Phe Glu Asp 225 230 235 240 Tyr Arg Leu Gly Ala Val Ser Val Ser Ala Lys Ala Lys Glu Val Leu 245 250 255 Lys Arg Arg Leu Ala Gly Glu Val Val Thr Ala Glu Thr Ser Gly Met 260 265 270 Ser Lys Gly Glu Trp Arg Glu Phe Val Glu 275 280 24 247 PRT Halobacterium salinarum 24 Met Arg Val Arg Leu Leu Glu Ala Thr Glu Asn Pro Glu Glu Leu Ile 1 5 10 15 Cys Gln Ser Ala Arg Asn Asp Tyr Met Ser Asp Trp Val Gly Asp Thr 20 25 30 Pro Leu Asp Thr Ala Met Ala Ser Val Asp Gly Asp Thr Thr Asp Glu 35 40 45 Lys Leu Ser Asn Leu Ile Ala Gln Leu Leu Thr Arg Gly His Tyr Gly 50 55 60 Pro Phe Glu His Pro Ser Ala Thr Phe Ala Ile Glu Gly Val Ser Arg 65 70 75 80 Ser Cys Met Ala Gln Leu Thr Arg His Arg His Ala Ser Phe Asp Val 85 90 95 Gln Ser Met Arg Tyr Val Ala Phe Asp Asp Val Asp Pro Ala Ala Val 100 105 110 Ala Glu Gly Glu Leu Val Val Thr Pro Pro Ser Ala Thr Asp Pro Asp 115 120 125 Trp Val Gly Arg Asn Gln Asp Ala Gly Asp Ile Asp Glu Glu Thr Met 130 135 140 Ala Glu Arg Gln Ala Val Phe Gln Ala Ser Val Arg Arg Ala Val Glu 145 150 155 160 Asp Tyr Gln Glu Leu Leu Gly Leu Gly Met Pro Pro Val Asp Ala Arg 165 170 175 Phe Val Leu Pro Ile Gly Thr Glu Val Asn Val Val Ile Thr Leu Asn 180 185 190 Pro Arg Ser Leu Met His Val Ala Asp Met Arg Ala Ala Ala Asp Ala 195 200 205 Gln Trp Glu Ile Arg Glu Leu Thr Glu Gln Leu Leu Asp Ala Ala Ala 210 215 220 Gln Trp Cys Pro His Thr Phe Glu Tyr Tyr Asp Ala Glu Met Lys His 225 230 235 240 Arg Lys Asn Arg Leu Ala Pro 245 25 254 PRT Mycobacterium leprae 25 Met Ala Gln Ile Ala Pro Leu Arg Val Gln Leu Ile Ala Lys Thr Glu 1 5 10 15 Phe Leu Ala Pro Pro Asp Val Ser Trp Thr Thr Asp Ala Asp Gly Gly 20 25 30 Ser Ala Leu Val Glu Phe Ala Gly Arg Ala Cys Tyr Gln Ser Trp Ser 35 40 45 Lys Pro Asn Pro Arg Thr Ala Thr Asn Ala Ala Tyr Ile Lys His Ile 50 55 60 Ile Asp Val Gly His Val Ala Val Leu Glu His Ala Ser Val Ser Phe 65 70 75 80 Tyr Ile Ser Gly Ile Ser Arg Ser Cys Thr His Glu Leu Ile Arg His 85 90 95 Arg His Phe Ser Tyr Ser Gln Leu Ser Gln Arg Tyr Val Pro Glu Lys 100 105 110 Asp Ala Gln Val Val Val Pro Pro Asp Met Glu Asp Asp Asp Glu Leu 115 120 125 Gln Gln Ile Leu Ile Ala Ala Val Glu Ala Ser Arg Ala Thr Tyr Thr 130 135 140 Glu Leu Leu Val Lys Leu Asn Ala Lys Leu Met Ala Gly Glu Leu Gly 145 150 155 160 Gly Asn Arg Ala Val Leu Arg Arg Lys Gln Ala Arg Gln Ala Ala His 165 170 175 Ala Val Leu Pro Asn Ala Asn Glu Thr Arg Ile Val Val Thr Gly Asn 180 185 190 Tyr Arg Ala Trp Arg His Phe Ile Ala Met Arg Ala Ser Glu His Ala 195 200 205 Asp Val Glu Ile Arg Arg Leu Ala Ile Val Cys Leu Arg Arg Leu Val 210 215 220 Asp Val Ala Pro Ala Val Phe Ala Asp Phe Glu Ile Thr Ala Leu Ala 225 230 235 240 Asp Gly Thr Glu Val Ala Thr Ser Pro Leu Ala Thr Glu Ala 245 250 26 208 PRT Helicobacter pylori 26 Met Glu Val Ile Cys Lys His Tyr Thr Pro Leu Asp Ile Ala Ser Gln 1 5 10 15 Ala Ile Arg Thr Cys Trp Gln Ser Phe Glu Tyr Ser Asp Asp Gly Gly 20 25 30 Cys Lys Asp Arg Asp Leu Ile His Arg Val Gly Asn Ile Phe Arg His 35 40 45 Ser Ser Thr Leu Glu His Leu Tyr Tyr Asn Phe Glu Ile Lys Gly Leu 50 55 60 Ser Arg Gly Ala Leu Gln Glu Leu Ser Arg His Arg Ile Ala Ser Leu 65 70 75 80 Ser Val Lys Ser Ser Arg Tyr Thr Leu Arg Glu Leu Lys Glu Val Glu 85 90 95 Ser Phe Leu Pro Leu Asn Glu Thr Asn Leu Glu Arg Ala Lys Glu Phe 100 105 110 Leu Val Phe Val Asp Asp Glu Lys Val Asn Glu Met Ser Val Leu Ala 115 120 125 Leu Glu Asn Leu Arg Val Leu Leu Ser Glu His Asn Ile Lys Asn Asp 130 135 140 Leu Ala Lys Tyr Ala Met Pro Glu Ser Tyr Lys Thr His Leu Ala Tyr 145 150 155 160 Ser Ile Asn Ala Arg Ser Leu Gln Asn Leu Leu Thr Leu Arg Ser Ser 165 170 175 Asn Lys Ala Leu Lys Glu Met Gln Asp Leu Ala Lys Ala Leu Phe Asp 180 185 190 Ala Leu Pro Tyr Glu His Gln Tyr Leu Phe Glu Asp Cys Leu Lys His 195 200 205 27 207 PRT Campylobacter jejuni 27 Met Gln Ile Thr Leu Leu Phe His Thr Pro Leu Ser Val Cys Ser His 1 5 10 15 Ala Thr Arg Thr Cys Trp Gln Ser Phe Glu Lys Gly Asp Cys Gly Gly 20 25 30 Glu Lys Asp Lys Glu Leu Ile Asp Arg Val Gly Asn Lys Phe Lys His 35 40 45 Ala Ser Thr Leu Glu His Leu Asn Tyr Thr Phe Tyr Ile Gln Gly Ile 50 55 60 Ser Arg Ala Cys Leu Gln Glu Val Ala Arg His Arg His Thr Ser Pro 65 70 75 80 Ser Val Lys Ser Thr Arg Tyr Thr Leu Lys Glu Leu Arg Asn Glu Ala 85 90 95 Glu Phe Lys Ile Gly Asp Phe Glu Asn Ala Ser Arg Tyr Leu Val Leu 100 105 110 Cys Gly Asn Glu Glu Val Asp Asn Ala Ser Ile Lys Ala Leu Glu Asn 115 120 125 Leu Arg Thr Ile Leu Gln Lys Ser Ile Ser Leu Asp Ile Ala Lys Tyr 130 135 140 Cys Leu Pro Glu Ser Tyr Lys Thr Glu Leu Thr Leu Thr Ile Asn Ala 145 150 155 160 Arg Ser Leu Gln Asn Phe Ile Ser Leu Arg Ser Ser Lys Ser Ala Leu 165 170 175 Trp Glu Ile Arg Asn Leu Ala Asn Ala Leu Phe Glu Ala Leu Pro Gln 180 185 190 Glu His Lys Phe Ile Phe Glu His Cys Leu His Lys Asp Ile Glu 195 200 205 28 260 PRT Sulfolobus solfataricus 28 Met Ile Ser Val Lys Leu Val Ser Tyr Thr Asn Asp Gly Glu Lys Val 1 5 10 15 Ile Ala Ile Ala Ala Lys Met Ser Arg Ser Arg Lys Gly Trp Asp Tyr 20 25 30 His Glu Lys Asp Met Thr Asp Asp Glu Ile Glu Thr Trp Ile Arg Asp 35 40 45 Ala Ile Leu His Gly Tyr Trp Ser Val Leu Glu His Ser Val Tyr Thr 50 55 60 Phe Ser Ile Glu Glu Ile Ser Arg Val Ala Ser His Gln Leu Val Arg 65 70 75 80 His Arg Ile Ala Ser Tyr Thr Gln Met Ser His Arg Phe Ala Lys Pro 85 90 95 Ile Asp Glu Tyr Tyr Lys Pro Ile Ile Pro Pro Ser Ala Lys Lys Arg 100 105 110 Ser Lys Glu Leu Val Glu Lys Ala Tyr Lys Glu Ala Tyr Asp Asn Tyr 115 120 125 Tyr Thr Leu Leu Glu Ser Gly Val Pro Glu Glu Asp Ala Arg Tyr Val 130 135 140 Leu Pro Asn Gly Val Asn Thr Asn Ile Val Val Thr Met Asn Ala Arg 145 150 155 160 Glu Leu Tyr Asn Phe Phe Ser Leu Arg Leu Cys Ser Arg Ala Gln Trp 165 170 175 Glu Ile Arg Ala Ile Ala Trp Lys Met Leu Glu Glu Val Lys Lys Val 180 185 190 His Pro Arg Leu Phe Lys Tyr Thr Gly Pro Asn Cys Ile Ile His Glu 195 200 205 Asn Phe Ile Arg Asn Glu Asn Glu Ser Ile Thr Leu Glu Asp Ile Phe 210 215 220 Lys Asp Tyr Lys Leu Glu Phe Ile Ser Gln Arg Cys Ile Glu Gly Val 225 230 235 240 Leu Arg Asp Gly Ile Lys Lys Cys Ile Ile Asn Ser Arg Ser Val Leu 245 250 255 Asp Asn Ile Lys 260 29 250 PRT Mycobacterium bovis 29 Val Ala Glu Thr Ala Pro Leu Arg Val Gln Leu Ile Ala Lys Thr Asp 1 5 10 15 Phe Leu Ala Pro Pro Asp Val Pro Trp Thr Thr Asp Ala Asp Gly Gly 20 25 30 Pro Ala Leu Val Glu Phe Ala Gly Arg Ala Cys Tyr Gln Ser Trp Ser 35 40 45 Lys Pro Asn Pro Lys Thr Ala Thr Asn Ala Gly Tyr Leu Arg His Ile 50 55 60 Ile Asp Val Gly His Phe Ser Val Leu Glu His Ala Ser Val Ser Phe 65 70 75 80 Tyr Ile Thr Gly Ile Ser Arg Ser Cys Thr His Glu Leu Ile Arg His 85 90 95 Arg His Phe Ser Tyr Ser Gln Leu Ser Gln Arg Tyr Val Pro Glu Lys 100 105 110 Asp Ser Arg Val Val Val Pro Pro Gly Met Glu Asp Asp Ala Asp Leu 115 120 125 Arg His Ile Leu Thr Glu Ala Ala Asp Ala Ala Arg Ala Thr Tyr Ser 130 135 140 Glu Leu Leu Ala Lys Leu Glu Ala Lys Phe Ala Asp Gln Pro Asn Ala 145 150 155 160 Ile Leu Arg Arg Lys Gln Ala Arg Gln Ala Ala Arg Ala Val Leu Pro 165 170 175 Asn Ala Thr Glu Thr Arg Ile Val Val Thr Gly Asn Tyr Arg Ala Trp 180 185 190 Arg His Phe Ile Ala Met Arg Ala Ser Glu His Ala Asp Val Glu Ile 195 200 205 Arg Arg Leu Ala Ile Glu Cys Leu Arg Gln Leu Ala Ala Val Ala Pro 210 215 220 Ala Val Phe Ala Asp Phe Glu Val Thr Thr Leu Ala Asp Gly Thr Glu 225 230 235 240 Val Ala Thr Ser Pro Leu Ala Thr Glu Ala 245 250 30 251 PRT Corynebacterium diphtheriae 30 Met Gln Gln Ser Leu Leu Asp Val Gln Leu Val Ala Cys Ser Thr Phe 1 5 10 15 Thr Thr Pro Ser Gly Val Asp Trp Lys Val Asp Ser Ala Ala Thr Asp 20 25 30 Ser Glu Ala Leu Val Glu Phe Ala Gly Arg Ala Cys Tyr Glu Thr Phe 35 40 45 Asp Lys Pro Asn Pro Arg Thr Ala Ala Asn Asp Ala Tyr Ile Arg His 50 55 60 Ile Met Glu Val Gly His Met Ala Leu Leu Glu His Pro Thr Ala Thr 65 70 75 80 Val Tyr Ile Arg Gly Leu Ser Arg Ser Ala Thr His Glu Leu Val Arg 85 90 95 His Arg His Phe Ser Phe Ser Gln Leu Ser Gln Arg Phe Val His Ser 100 105 110 Asp Glu Thr His Val Val Ile Pro Pro Leu Ile Asp Asn Asp Pro Gln 115 120 125 Leu Arg Glu Leu Phe Leu Ser Thr Val Asp Glu Val Arg Phe Ala Tyr 130 135 140 Ser Glu Leu Met Thr Ala Leu Asp Asn Lys Leu Ala Asp Glu Pro Asn 145 150 155 160 Ala Ile Leu Arg Arg Lys Gln Ala Arg Gln Ala Ala Arg Ser Ile Leu 165 170 175 Pro Asn Ala Thr Glu Ser Arg Ile Val Val Thr Gly Asn Phe Arg Ala 180 185 190 Trp Arg His Phe Ile Gly Met Arg Ala Thr Glu His Ala Asp Val Glu 195 200 205 Ile Arg Ser Leu Ala Val Arg Cys Leu Glu Ile Leu Lys Glu Lys Ala 210 215 220 Pro Thr Val Phe Ser Asp Phe Glu Thr Ser Val Leu Ser Asp Gly Ser 225 230 235 240 Ile Met Ala Thr Ser Pro Tyr Val Thr Asp Tyr 245 250 31 161 PRT Bacillus anthracis 31 Asp Arg Leu Ile Arg His Ile Val Gly Ser Gly His Ala Ser Thr Leu 1 5 10 15 Glu His Leu Thr Tyr Thr Phe Ala Val Glu Gly Val Ser Arg Ala Leu 20 25 30 Leu Ala Gln Leu Thr Arg His Arg Val Gly Phe Ser Tyr Ser Val Gln 35 40 45 Ser Gln Arg Tyr Val Arg Met Gly Ser Asn Asp Lys Ile Gly Gly Phe 50 55 60 Asp Tyr Val Val Pro Glu Thr Val Lys Ala Lys Gly Glu Gln Val Val 65 70 75 80 Asn Ala Tyr Asn Glu Met Met Tyr Lys Leu Gln Ser Gly Tyr Asp Leu 85 90 95 Leu Arg Thr Leu Gly Ile Pro Ala Glu Asp Ala Arg Ser Val Leu Pro 100 105 110 Asn Ala Ala Ala Thr Asn Leu Val Leu Thr Val Asn Leu Arg Gly Leu 115 120 125 Leu Asp Phe Tyr Asn Lys Arg Arg Lys Gly Lys Gly Ala Gln Ala Glu 130 135 140 Ile Ala Glu Leu Ala Glu Gln Leu Arg Gln Glu Val Val Lys Val Glu 145 150 155 160 Lys 32 212 PRT Clostridium difficile 32 Lys Leu Ile Ser His Thr Pro Glu Pro Glu Lys Val Ile Ala Met Ala 1 5 10 15 Ala Lys Leu Cys Tyr Ser Pro Val Gly Thr Asp Glu Ile Glu Lys Asp 20 25 30 Leu Thr Asp Glu Ser Ile Glu Lys Phe Leu Asn Met Leu Leu Ser Ile 35 40 45 Gly His Gly Ser Ile Leu Glu His Ala Ser Phe Thr Phe Ser Ile Glu 50 55 60 Gly Ile Ser Arg Ala Cys Ser His Gln Ile Val Arg His Arg Ile Ala 65 70 75 80 Ser Phe Ser Gln Gln Ser Gln Arg Tyr Val Lys Leu Glu Gln Phe Glu 85 90 95 Tyr Ile Ile Pro Pro Glu Ile Glu Lys Glu Leu Phe Ile Asp Ser Met 100 105 110 Lys Lys Asp Gln Glu Asn Tyr Asp Lys Leu Val Glu Ile Leu Phe Glu 115 120 125 Asn His Tyr Asn Asp Leu Ile Lys Asn Gly Lys Asn Glu Lys Thr Ala 130 135 140 Lys Arg Gln Ala Glu Lys Lys Ala Ile Glu Asp Ala Arg Tyr Val Phe 145 150 155 160 Pro Asn Ala Cys Glu Thr Lys Met Val Phe Thr Ile Asn Ala Arg Ser 165 170 175 Leu Phe Asn Phe Phe Glu His Arg Cys Cys Glu Arg Ala Gln Trp Glu 180 185 190 Ile Arg Asn Leu Ala Val Glu Met Leu Arg Glu Val Lys Lys Val Ala 195 200 205 Pro Ile Leu Phe 210 33 515 PRT Thermoplasma volcanium 33 Met Glu Phe Ser Asn Ala Glu Arg Asp Val Phe Leu Ile Lys Ile Glu 1 5 10 15 Lys Met Ile Asp Arg Gly Ala Leu Met Ser Arg Tyr Ser Arg Ala Ser 20 25 30 Asp Pro Asp Ile Arg Ser Val Tyr Glu Lys Glu Phe Lys Thr Gly Ala 35 40 45 Lys Ser Gly Glu Glu Phe Tyr Arg Arg Ile Phe Leu Glu Tyr Gly Asp 50 55 60 Glu Ser Ile Ala Glu Leu Thr Thr Ala Gln Met Gly Ile Gln Asn Val 65 70 75 80 Ser Asn Val Ala Ser Lys Ile Ile Glu Glu Ile Arg Ile Gly Leu Ser 85 90 95 Tyr Leu Glu Lys Ser Thr Arg Tyr Val Arg Tyr Asp Lys Lys Val Asp 100 105 110 Gly Arg Tyr Leu Tyr Ile Ser Pro Glu Arg Ile Gly Ile Ser Gly Asp 115 120 125 Asp Ala Lys Asp Tyr Val Gln Leu Cys Asp Asn Leu Phe Glu Phe Tyr 130 135 140 Ser Lys Ala Leu Pro Gln Val Glu Asp Tyr Leu Arg His Lys Phe Pro 145 150 155 160 Gln Asp Lys Leu Val Phe Gln Asn Ala Gly Gly Lys Thr Leu Thr Glu 165 170 175 Met Asp Gln Asn Glu Lys Lys Ile Ala Glu Arg Ser Tyr Ile Asn Ala 180 185 190 Val Arg Ser Arg Ala Leu Asp Asp Val Arg Tyr Ile Leu Pro Ala Ser 195 200 205 Thr Leu Thr Asn Ile Gly Ile Ser Gly Asn Gly Arg Ala Leu Ile His 210 215 220 Leu Ile Gln Lys Leu Met Glu Tyr Glu Ile Pro Glu Thr Thr Lys Leu 225 230 235 240 Ala Lys Asp Ile Tyr Asp Glu Leu Lys Pro Glu Leu Pro Gln Leu Ile 245 250 255 Asp Asp Ala Leu Ser Gly His Gly Leu Glu Ile Ile Asn Phe Lys Lys 260 265 270 Asn Leu Met Gly Leu Phe Pro Tyr Asp Leu Thr Gly Asn Phe Glu Arg 275 280 285 Ile Arg Leu Leu Ser Tyr Gly Glu Glu Asp Lys Glu Leu Arg Lys Val 290 295 300 Ala Ser Leu Ile Glu Tyr Pro Phe His Gly Asp Ala Ala Ser Leu Tyr 305 310 315 320 Ser Arg Ser Ser Glu Ser Tyr Val Lys Tyr Met Lys Glu Leu Ile Glu 325 330 335 Ser Ile Arg Ser Leu Arg Ala Asn Arg Arg Met Lys Pro Gly Arg Ala 340 345 350 Phe Glu Ser Val Asn Tyr Val Phe Glu Leu Asn Leu Asn Tyr Gly Ser 355 360 365 Phe Arg Asp Leu Gln Arg His Arg Phe Leu Gly Ile Ile Arg Lys Pro 370 375 380 Leu Thr Ala Ala Tyr Gly Tyr Asp Thr Pro Pro Val Ile Ser Ala Ile 385 390 395 400 Asp Glu Leu Lys Thr Gln Tyr Asp Glu Leu Met Ala Asn Ser Ser Ser 405 410 415 Phe Tyr Gln Arg Leu Arg Glu Lys Tyr Gly Pro Trp Ile Ser Gln Tyr 420 425 430 Val Val Pro Phe Ala Phe Lys Tyr Pro Ile Thr Phe Ser Thr Asn Leu 435 440 445 Ser Glu Val Thr Tyr Phe Val Glu Leu Arg Ser Thr Ala Gln Ala His 450 455 460 Phe Asp Leu Arg Asp Ile Ala Val Ser Met Tyr Arg Glu Val Ser Lys 465 470 475 480 Val His Pro Thr Leu Ser Arg Ile Ile Lys Phe Val Asp Thr Ala Asp 485 490 495 Tyr Pro Leu Gly Arg Leu Ser Ala Glu Phe Arg Lys Glu Ser Lys Lys 500 505 510 Ala Gly Ile 515 34 499 PRT Thermoplasma acidophilum 34 Met Ile Asp Arg Gly Ala Leu Met Ser Arg Tyr Ser Arg Ala Ser Asp 1 5 10 15 Pro Asp Ile Arg Ser Val Phe His Arg Glu Phe Glu Gly Asn Gln Lys 20 25 30 Arg Ser Glu Asp Phe Tyr Arg Arg Ile Phe Leu Glu Tyr Gly Asp Glu 35 40 45 Ser Ile Ala Glu Leu Val Thr Ala Gln Val Gly Ile Gln Asn Val Ser 50 55 60 Asn Val Ile Ser Lys Val Ile Glu Glu Ile Arg Ile Gly Leu Ser Tyr 65 70 75 80 Leu Glu Lys Ser Thr Arg Tyr Val Ala Tyr Asp Arg Lys Val Asp Gly 85 90 95 His Tyr Leu Phe Met Gln Ala Glu Lys Ile Gly Leu Ser Gly Glu Ala 100 105 110 Ala Arg Glu Tyr Thr Asp Leu Cys Asn Arg Leu Phe Asp Leu Tyr Ser 115 120 125 Ser Thr Leu Pro Arg Ile Glu Glu Glu Ile Ser Arg Gln Trp Pro Ile 130 135 140 Glu Ser Phe Asp Phe Asn Ile Asp Gly Asn Pro Arg Asn Tyr Lys Glu 145 150 155 160 Leu Asp Glu Asn Gly Arg Lys Leu Ala Gln Lys Ser Tyr Arg Ser Ser 165 170 175 Val Arg Ser Arg Ala Leu Asp Asp Ala Arg Phe Ile Leu Pro Ala Ser 180 185 190 Thr Leu Thr Asn Met Gly Val Ser Gly Asn Gly Arg Ser Phe Ile His 195 200 205 Leu Ile Gln Lys Leu Met Glu Tyr Gly Val Pro Glu Ser Glu Arg Leu 210 215 220 Ala His Asp Leu Tyr Glu Glu Leu Lys Gly Glu Phe Pro Gln Ile Ile 225 230 235 240 Asp Asp Ala Leu Ser Gln His Gly Gln Asp Ile Ile Asn Tyr Lys Arg 245 250 255 Ser Leu Ala Ser Leu Phe Pro Tyr Thr Asp Gly Gly Arg Phe Glu Lys 260 265 270 Val Arg Leu Ile Lys Tyr Ser Asn Glu Arg Glu Glu Met Gln Lys Val 275 280 285 Leu Ala Leu Leu Met Tyr Pro Phe Ala Glu Asp Ala Ser Gly Ile Ile 290 295 300 Ser Arg Ile Lys Ala Met Glu Leu Ser Glu Ala Ser Ala Ile Leu Glu 305 310 315 320 Arg Ile Arg Asp Leu Arg Lys Asn Arg Arg Met Lys Val Gly Arg Pro 325 330 335 Phe Glu Ala Val Asn Tyr Val Phe Glu Val Thr Thr Asn Tyr Gly Ala 340 345 350 Phe Arg Asp Leu Gln Arg His Arg Phe Leu Ser Ile Val Arg Lys Pro 355 360 365 Leu Thr Val Ser Tyr Gly Phe Asp Val Pro Pro Ile Ile Ala Lys Met 370 375 380 Pro Asp Leu Ser Glu Glu Tyr Ala Glu Ala Met Lys Asp Ala Glu Arg 385 390 395 400 Val Tyr Arg Ile Ile Lys Glu Arg Tyr Gly Ala Trp Ile Ala Gln Tyr 405 410 415 Ala Val Pro Phe Ala Tyr Arg Tyr Pro Val Val Phe Thr Thr Asn Leu 420 425 430 Ala Glu Ala Thr Tyr Phe Ile Glu Leu Arg Ser Thr Pro Gln Ala His 435 440 445 Phe Asp Leu Arg Asp Ile Ala Ile Arg Met Tyr Asn Glu Ile Lys Ser 450 455 460 Val His Pro Ser Leu Ala Gly Ile Ile Lys Phe Val Asp Thr Gly Asp 465 470 475 480 Tyr Pro Leu Gly Arg Leu Ser Ala Glu Val Arg Lys Asn Val Lys Ala 485 490 495 Gly Gly Ile 35 531 PRT Chlamydophila pneumoniae 35 Met Leu Gly Lys Glu Glu Glu Phe Thr Cys Lys Gln Lys Gln Cys Leu 1 5 10 15 Ser His Phe Val Thr Asn Leu Thr Ser Asp Val Phe Ala Leu Lys Asn 20 25 30 Leu Pro Glu Val Val Lys Gly Ala Leu Phe Ser Lys Tyr Ser Arg Ser 35 40 45 Val Leu Gly Leu Arg Ala Leu Leu Leu Lys Glu Phe Leu Ser Asn Glu 50 55 60 Glu Asp Gly Asp Val Cys Asp Glu Ala Tyr Asp Phe Glu Thr Asp Val 65 70 75 80 Gln Lys Ala Ala Asp Phe Tyr Gln Arg Val Leu Asp Asn Phe Gly Asp 85 90 95 Asp Ser Val Gly Glu Leu Gly Gly Ala His Leu Ala Met Glu Asn Val 100 105 110 Ser Ile Leu Ala Ala Lys Val Leu Glu Asp Ala Arg Ile Gly Gly Ser 115 120 125 Pro Leu Glu Lys Ser Thr Arg Tyr Val Tyr Phe Asp Gln Lys Val Arg 130 135 140 Gly Glu Tyr Leu Tyr Tyr Arg Asp Pro Ile Leu Met Thr Ser Ala Phe 145 150 155 160 Lys Asp Met Phe Leu Gly Thr Cys Asp Phe Leu Phe Asp Thr Tyr Ser 165 170 175 Ala Leu Ile Pro Gln Val Arg Ala Tyr Phe Glu Lys Leu Tyr Pro Lys 180 185 190 Asp Ser Lys Thr Pro Ala Ser Ala Tyr Ala Thr Ser Leu Arg Ala Lys 195 200 205 Val Leu Asp Cys Ile Arg Gly Leu Leu Pro Ala Ala Thr Leu Thr Asn 210 215 220 Leu Gly Phe Phe Gly Asn Gly Arg Phe Trp Gln Asn Leu Ile His Lys 225 230 235 240 Leu Gln Gly His Asn Leu Ala Glu Leu Arg Arg Leu Gly Asp Glu Ser 245 250 255 Leu Thr Glu Leu Met Lys Val Ile Pro Ser Phe Val Ser Arg Ala Glu 260 265 270 Pro His His His His His Gln Ala Met Met Gln Tyr Arg Arg Ala Leu 275 280 285 Lys Glu Gln Leu Lys Gly Leu Ala Glu Gln Ala Thr Phe Ser Glu Glu 290 295 300 Met Ser Ser Ser Pro Ser Val Gln Leu Val Tyr Gly Asp Pro Asp Gly 305 310 315 320 Ile Tyr Lys Val Ala Ala Gly Phe Leu Phe Pro Tyr Ser Asn Arg Ser 325 330 335 Leu Thr Asp Leu Ile Asp Tyr Cys Lys Lys Met Pro His Glu Asp Leu 340 345 350 Val Gln Ile Leu Glu Ser Ser Val Ser Ala Arg Glu Asn Arg Arg His 355 360 365 Lys Ser Pro Arg Gly Leu Glu Cys Val Glu Phe Gly Phe Asp Ile Leu 370 375 380 Ala Asp Phe Gly Ala Tyr Arg Asp Leu Gln Arg His Arg Thr Leu Thr 385 390 395 400 Gln Glu Arg Gln Leu Leu Ser Thr His His Gly Tyr Asn Phe Pro Val 405 410 415 Glu Leu Leu Asp Thr Pro Met Glu Lys Ser Tyr Arg Glu Ala Met Glu 420 425 430 Arg Ala Asn Glu Thr Tyr Asn Glu Ile Val Gln Glu Phe Pro Glu Glu 435 440 445 Ala Gln Tyr Met Val Pro Met Ala Tyr Asn Ile Arg Trp Phe Phe His 450 455 460 Val Asn Ala Arg Ala Leu Gln Trp Ile Cys Glu Leu Arg Ser Gln Pro 465 470 475 480 Gln Gly His Gln Asn Tyr Arg Thr Ile Ala Thr Gly Leu Val Arg Glu 485 490 495 Val Val Lys Phe Asn Pro Met Tyr Glu Leu Phe Phe Lys Phe Val Asp 500 505 510 Tyr Ser Asp Ile Asp Leu Gly Arg Leu Asn Gln Glu Met Arg Lys Glu 515 520 525 Pro Thr Thr 530 36 529 PRT Chlamydia trachomatis 36 Met Leu Ser Lys Glu Gly Gly Phe Ser Glu Glu Gln Arg Ala Arg Leu 1 5 10 15 Ser His Phe Val Thr Asn Leu Asp Ser Pro Ile Phe Ala Leu Lys Asn 20 25 30 Leu Pro Glu Val Val Lys Gly Ala Leu Phe Ser Lys Tyr Ser Arg Ser 35 40 45 Thr Leu Gly Leu Arg Ala Leu Leu Leu Lys Glu Phe Leu Asp Gly Glu 50 55 60 Gly Gly Asn Phe Leu Asp Asp Asp Gln Gln Asp Cys Glu Leu Gly Ile 65 70 75 80 Gln Lys Ala Ala Asp Phe Tyr Arg Arg Val Leu Asp Asn Phe Gly Asp 85 90 95 Asp Ser Val Gly Glu Leu Gly Gly Ala His Leu Ala Leu Glu Gln Val 100 105 110 Ser Met Leu Ala Ala Lys Ile Leu Glu Asp Ala Arg Ile Gly Gly Ser 115 120 125 Pro Leu Glu Lys Ser Ser Arg Tyr Val Tyr Phe Asp Gln Lys Val Asn 130 135 140 Gly Glu Tyr Leu Tyr Tyr Arg Asp Pro Ile Leu Met Thr Ser Ala Phe 145 150 155 160 Lys Asp Val Phe Leu Asp Thr Cys Asp Phe Leu Phe Asn Thr Tyr Ser 165 170 175 Asp Leu Ile Pro Gln Val Arg Ser His Phe Glu Lys Leu Tyr Pro Lys 180 185 190 Asp Pro Glu Val Ser Gln Ser Ala Tyr Thr Val Ser Leu Arg Ala Lys 195 200 205 Val Leu Asp Cys Leu Arg Gly Leu Leu Pro Ala Ala Thr Leu Thr Asn 210 215 220 Leu Gly Phe Phe Gly Asn Gly Arg Phe Trp Gln Asn Leu Leu His Arg 225 230 235 240 Leu Gln Asp Asn Ser Leu Val Glu Val Arg Asn Ile Gly Glu Gln Ser 245 250 255 Leu Thr Glu Leu Met Lys Ile Ile Pro Ser Phe Val Ser Arg Ala Glu 260 265 270 Ser His His Tyr His His Gln Ala Met Val Asp Tyr Arg Arg Ala Leu 275 280 285 Lys Glu Gln Leu Lys Ser Phe Ala His Arg Tyr Gly Glu Glu Arg Glu 290 295 300 Ile Ser Lys Glu Ala Gly Val Lys Leu Val Tyr Gly Asp Pro Asp Gly 305 310 315 320 Leu Tyr Lys Ile Ala Ala Ala Tyr Met Phe Pro Tyr Ser Glu His Thr 325 330 335 Tyr Ala Glu Leu Leu Asp Ile Cys Arg Asn Ile Pro Asn Glu Asp Leu 340 345 350 Met Arg Ile Leu Glu Ser Gly Ala Ser Phe Arg Glu Asn Arg Arg His 355 360 365 Lys Ser Pro Arg Gly Leu Glu Cys Ala Glu Phe Ala Phe Asp Ile Thr 370 375 380 Ala Asp Phe Gly Ala Tyr Arg Asp Leu Gln Arg His Arg Ile Leu Thr 385 390 395 400 Gln Glu Arg Gln Leu Leu Thr Thr Lys Leu Gly Tyr Thr Met Pro Ser 405 410 415 Gln Leu Ile Asp Thr Pro Met Glu Ala Pro Phe Arg Glu Ala Met Glu 420 425 430 Lys Ala Asp Gln Ala Tyr Arg Leu Ile Ala Glu Glu Phe Pro Glu Glu 435 440 445 Ala Gln Tyr Val Val Pro Leu Ala Tyr Asn Ile Arg Trp Leu Phe His 450 455 460 Ile Asn Ala Arg Gly Leu Gln Trp Leu Cys Glu Leu Arg Ser Gln Pro 465 470 475 480 Gln Gly His Glu Ser Tyr Arg Lys Ile Ala Ile Asp Met Ala Arg Glu 485 490 495 Val Ile Gln Phe His Pro Ala Tyr Glu Leu Phe Leu Lys Phe Val Asp 500 505 510 Tyr Ser Glu Thr Asp Leu Gly Arg Leu Gln Gln Glu Ser Arg Lys Lys 515 520 525 Ser 37 529 PRT Chlamydia muridarum 37 Met Leu Ser Lys Glu Gly Asp Phe Ser Lys Glu Gln Arg Glu Arg Leu 1 5 10 15 Ser His Phe Val Thr Asn Leu Asp Ser Pro Ile Phe Ala Leu Lys Asn 20 25 30 Leu Pro Glu Val Val Lys Gly Ala Leu Phe Ser Lys Tyr Ser Arg Ser 35 40 45 Ile Leu Gly Leu Arg Ala Leu Leu Leu Lys Glu Phe Leu Asp Gly Glu 50 55 60 Gly Gly Asp Phe Leu Ser Glu Asp Leu Gln Asp Cys Glu Leu Gly Ile 65 70 75 80 Gln Lys Ala Ala Asp Phe Tyr Arg Arg Val Leu Asp Asn Phe Gly Asp 85 90 95 Asp Ser Val Gly Glu Leu Gly Gly Ala His Leu Ala Ile Glu Gln Val 100 105 110 Ser Met Leu Ala Ala Lys Val Leu Glu Asp Ala Arg Ile Gly Gly Ser 115 120 125 Pro Leu Glu Lys Ser Ser Arg Tyr Val Tyr Phe Asp Gln Lys Val Asn 130 135 140 Gly Glu Tyr Leu Tyr Tyr Arg Asp Pro Ile Leu Met Thr Ser Ala Phe 145 150 155 160 Lys Asp Thr Phe Leu Asp Thr Cys Asp Phe Leu Phe Asn Thr Tyr Ser 165 170 175 Glu Leu Ile Pro Gln Val Arg Ala Tyr Phe Glu Lys Ile Tyr Pro Lys 180 185 190 Asp Pro Glu Val Ser Gln Ser Ala Tyr Thr Val Ser Leu Arg Ala Lys 195 200 205 Val Leu Asp Cys Leu Arg Gly Leu Leu Pro Ala Ala Thr Leu Thr Asn 210 215 220 Leu Gly Phe Phe Gly Asn Gly Arg Phe Trp Gln Asn Leu Leu His Arg 225 230 235 240 Leu Gln Asp Asn Asn Leu Val Glu Val Arg Asn Ile Gly Glu Gln Ala 245 250 255 Leu Thr Glu Leu Met Lys Ile Ile Pro Ser Phe Val Ser Arg Ala Glu 260 265 270 Pro His His His His His Gln Ala Met Val Asp Tyr His Leu Gly Leu 275 280 285 Arg Glu Gln Leu Lys Ser Phe Ala Gln Arg Tyr Gly Glu Glu Arg Glu 290 295 300 Pro Ser Leu Glu Lys Gly Val Lys Leu Val Tyr Gly Asp Pro Asp Gly 305 310 315 320 Leu Tyr Lys Ile Ala Ala Ala Ser Met Phe Pro Tyr Ser Glu His Thr 325 330 335 Tyr Ala Asp Leu Leu Asp Ile Cys Arg Lys Ile Pro Asp Glu Asp Leu 340 345 350 Met Leu Ile Leu Glu Ser Ser Ala Ser Ser Arg Glu Asn Arg Arg His 355 360 365 Lys Ser Pro Arg Gly Leu Glu Cys Ala Glu Phe Ala Phe Asp Ile Thr 370 375 380 Ala Asp Phe Gly Ala Tyr Arg Asp Leu Gln Arg His Arg Ile Leu Thr 385 390 395 400 Gln Glu Arg Gln Leu Leu Thr Thr Lys Leu Gly Tyr Ser Ile Pro Gln 405 410 415 Gln Leu Leu Asp Thr Pro Met Glu Ala Pro Phe Arg Glu Ala Met Glu 420 425 430 Lys Ala Asp Gln Ala Tyr Arg Leu Ile Ala Ala Glu Phe Pro Glu Glu 435 440 445 Ala Gln Tyr Val Val Pro Leu Ala Tyr Asn Ile Arg Trp Leu Phe His 450 455 460 Ile Asn Thr Arg Gly Leu Gln Trp Leu Cys Glu Leu Arg Ser Gln Pro 465 470 475 480 Gln Gly His Glu Ser Tyr Arg Gln Ile Ala Ile Asp Met Ala Lys Glu 485 490 495 Val Ile Gln Phe His Pro Ala Tyr Lys Ser Phe Leu Lys Phe Val Asp 500 505 510 Tyr Ser Glu Thr Asp Leu Gly Arg Leu Lys Gln Glu Ser Arg Arg Lys 515 520 525 Ala 38 24 DNA Artificial sequence Description of artificial sequence primer 38 gccatgggga tcacccaaga aact 24 39 24 DNA Artificial sequence Description of artificial sequence primer 39 atgcttcaaa caatcttcaa acaa 24 40 24 DNA Artificial sequence Description of artificial sequence primer 40 gccatggttc gagttacgct cgtc 24 41 24 DNA Artificial sequence Description of artificial sequence primer 41 taagggtttt tcgagaagtt tcat 24 42 24 DNA Artificial sequence Description of artificial sequence primer 42 gccatggaaa tcactttact tttt 24 43 24 DNA Artificial sequence Description of artificial sequence primer 43 ttctatatct ttatgtaggc aatg 24 44 693 DNA Helicobacter pylori 44 atgtggatca cccaagaaac ttggctaaaa gcgttaccgt ggaataaaaa gcgatatagg 60 agtcaaataa tggaagtgat ttgtaagcac tatacccctt tagacattgc gagccaagcg 120 atccgcactt gctggcagag ctttgaatac agcgatgatg gaggctgtaa ggataaagaa 180 ttgatccaca gggtggggaa tatttttagg cattcttcca ctttagagca tctttattac 240 aattttgaaa tcaagggttt gagcaggggg gcgttgcaag aattgagccg gcatagaata 300 gcgagcttga gcgtgaaatc aagccgttac actttgaggg aattgaaaga agtggagagc 360 tttttacccc ttaatgaaac gaatttagaa agagcgaaag agtttttagt ttttgtggat 420 aatgaaaaag tgaatgcaat gagcgtttta gctttggaaa atctaaggat cttattgagc 480 gagcataaca ttaaaaacga tttagccaaa tacgccatgc ctgaaagcta taaaacgcat 540 ttggcctata gtattaacgc taggagcttg caaaatttct tgactttaag aagcagtaat 600 aaagccttaa aagaaatgca agatttagcc aaagccttat ttgacgcttt acctggcgag 660 catcagtatt tgtttgaaga ttgtttgaag cat 693 45 624 DNA Helicobacter pylori 45 atggaagtga tttgtaagca ttacacccct ttagacattg cgagccaagc gatccgcact 60 tgctggcaga gttttgaata cagcgatgat ggaggttgta aggataggga tttgatccac 120 agggtgggga atattttcag gcattcttcc actttagagc atctttatta caattttgaa 180 atcaagggtt tgagtagggg ggcgttgcaa gaattgagcc gacatcgaat agcgagcttg 240 agcgtgaaat caagccgtta cactttaagg gaattgaaag aagtggagag ctttttaccc 300 cttaatgaaa cgaatttaga aagagctaaa gagtttttag tttttgtaga tgatgaaaaa 360 gtgaatgaaa tgagcgtttt agctttggaa aatctcaggg ttttattgag cgagcataac 420 attaaaaacg atttagccaa atacgccatg cctgaaagct ataaaacgca tttagcctat 480 agcattaacg ctaggagctt gcaaaattta ttgactttaa gaagcagcaa taaagcctta 540 aaagaaatgc aagatttagc caaagcctta tttgacgcct taccttatga gcatcaatat 600 ttgtttgagg attgtttgaa gcac 624 46 621 DNA Campylobacter jejuni 46 atgcaaatca ctttactttt tcacactcca ttatctgttt gttctcatgc aacaagaact 60 tgttggcaaa gctttgaaaa aggcgattgc ggtggcgaaa aagataaaga acttatcgat 120 cgcgtgggta ataaattcaa acacgcttca accttagaac atctaaacta tactttttat 180 atacaaggaa tttcaagagc ttgcttacaa gaagttgcaa gacaccgtca cactagtcct 240 agtgtaaaaa gcacgcgtta taccttaaaa gaacttcgca atgaagctga atttaaaata 300 ggagattttg aaaatgcaag ccgttatctt gtgctttgtg gtaatgaaga ggttgataac 360 gcaagcatta aagccttaga aaatttacgc actattttac aaaaaagcat tagtctagat 420 atagccaaat actgcttacc agaaagctat aaaaccgaac ttacactaac gattaatgca 480 agaagtttac aaaattttat atctttgcgt agttcaaaat cagctctttg ggagattaga 540 aatttagcaa atgctttatt tgaagcctta ccacaagaac acaaatttat atttgaacat 600 tgcctacata aagatataga a 621 47 882 DNA Rickettsia prowazekii 47 atgcacaata cgacaaaaag agtaacagta ccagcacttg aagcaatgtt atatgagact 60 atcaaagttt tagatcatgg ttttattagg gtaatcgatt atatgggtga tgatagttcg 120 atagtacagg cagctcgtgt ttcttatggt aaaggtacta agcagttaaa ccaagataaa 180 ggattaataa attacttgct gcgtcattat catacgacac cttttgagat gtgtgacatc 240 aaatttcaca ttaaattacc tatatttatt gcacggcagt ggattaggca taggactgct 300 agtgttaacg aatattcagc aaggtattct attttaggaa atgaatttta tttacctgac 360 cctgccaata ttgcttctca atctgttgta aataaacaat gtagagcagg ggatagcgta 420 ccgaaaaaag tatctgaaaa agttctcgca attttagaag aagatgctag acggtgctac 480 aggcattata aggagctcat gaatgctgat gaagatggaa atattctaga tgagaatgtt 540 tcaggcatag caagagaact tgctcgtatt aatttaactt tgaattatta tacggaatgg 600 tattggaaga ttaatttaca taatttgctt cattttttaa gattacgaac tgatcctaag 660 gcacaatatg aaattagagt ttatgccgaa aagatacttg acatagtaaa agcttgggta 720 ccttttactt acgaagcgtt tgaagagtat cgtttgcaag gcgcgaatat ttcacgtaaa 780 ggtttagagg taattaaaag aatgataaaa ggtgaaaaag tgatccatga aactagtggt 840 atgaataaaa gagaatggga ggaattggta aagattttta gg 882 48 660 DNA Thermotoga maritima 48 atgaagatcg atatcttgga caaaggattc gttgaacttg tggatgtgat gggaaacgac 60 ctctccgctg tacgggctgc ccgcgtttct ttcgatatgg gtttgaaaga cgaagaaagg 120 gatagacatc tcatcgaata tctcatgaaa cacggtcacg agacaccttt cgaacatatt 180 gtcttcactt tccacgtgaa agcccctata ttcgtggcga ggcagtggtt cagacacagg 240 atcgcttctt acaacgaact gagcggcaga tactcgaagc tctcctacga attctacatt 300 ccctctcccg aacgcctgga agggtacaaa acgaccatcc ctcccgaacg ggtgacggag 360 aagatctcag aaatagtcga taaagcgtat cgaacgtatc tggagttgat agaaagcggt 420 gttcctcgag aagtggcaag gatagtgctc cctttgaatc tgtacacgag gtttttctgg 480 actgtgaacg caagaagtct catgaacttt ttgaacctga gggcagattc tcacgctcag 540 tgggaaattc aacagtacgc tctggcaatc gcgagaattt tcaaagagaa gtgtccctgg 600 acttttgagg catttcttaa gtatgcttat aaaggagata tcctgaagga ggtacaggta 660 49 729 DNA Pyrococcus horikoshii 49 atggttaagg taaaactgat aaattacact cccaaaccct tggagacagt tacttgggct 60 gcacttataa gctattggga tggttggagt accgaagctt ttgagaagat atcccctaat 120 gatgttgaaa ttcacctacc tagaatccta agctacggac acgaatcaat cctggagcat 180 gcaactttca cattctccat agagggatgc tcaagggtgt gcacgcatca gctcgttagg 240 cataggatag caagttacac ccaacagagt caaaggtaca ttaaaataaa ccctgaagat 300 gttgaggaaa ccttcgtcat accggagtcc ataaaaaaag attcggagtt attaaaagag 360 tggaaagagc tacttaagag atctttagaa ttgtacgaga aaagcattga aagggggata 420 catcaagagg atgcaagatt catccttccc caatcagtta aaacgaaaat tgtagtaacg 480 atgaacctaa gggaactaaa gcacttcttt ggattgagat tatgcgagag ggcccaatgg 540 gagatcaggg aagttgcttg gaagatgctt gaggaaattg ccaaaaggaa agagctcaag 600 ccgataattg agtgggctaa gctagggcca agatgcatcc aacttggcta ctgtccagag 660 agggaactta tgccaccagg ttgcttaaaa agaacgaggg aaaggtggaa aaacttattg 720 gaaaagtat 729 50 750 DNA Mycobacterium tuberculosis 50 gtggccgaga ccgcgccgct gcgcgtgcaa ctgatcgcca agaccgactt cttggcccca 60 cccgacgtgc cctggaccac cgacgccgac ggcggacccg cgctggtcga gttcgccggc 120 cgggcctgct atcagagctg gtccaagccc aatcccaaga ccgccaccaa cgccggctac 180 ctccggcaca tcatcgacgt cggacatttc tcggtgctag agcatgccag cgtgtcgttc 240 tacatcaccg ggatctcgcg atcgtgcacc cacgagctga tccgccaccg gcatttctcc 300 tactcgcagc tctcccagcg ctacgtaccc gagaaggact cgcgggtcgt cgtgccgccc 360 ggcatggagg acgacgccga cctgcgccac atcctgaccg aggccgccga cgccgcccgc 420 gccacctaca gcgagctgct ggccaagctg gaagccaagt tcgccgacca acccaacgcg 480 atcctgcgcc gcaagcaggc ccgccaagcc gcccgcgcgg tgctgcccaa cgccaccgaa 540 acccgcatcg tggtgaccgg caactaccgg gcctggcggc acttcatcgc aatgcgggcc 600 agcgagcacg ccgacgtgga aatccggcga ctggccatcg aatgcctgcg ccagctcgcc 660 gccgtggccc ccgcggtgtt cgccgacttc gaggtgacca ccctggccga cggcaccgag 720 gtggcgacca gcccgttggc gaccgaagcc 750 51 750 DNA Mycobacterium tuberculosis 51 gtggccgaga ccgcgccgct gcgcgtgcaa ctgatcgcca agaccgactt cttggcccca 60 cccgacgtgc cctggaccac cgacgccgac ggcggacccg cgctggtcga gttcgccggc 120 cgggcctgct atcagagctg gtccaagccc aatcccaaga ccgccaccaa cgccggctac 180 ctccggcaca tcatcgacgt cggacatttc tcggtgctag agcatgccag cgtgtcgttc 240 tacatcaccg ggatctcgcg atcgtgcacc cacgagctga tccgccaccg gcatttctcc 300 tactcgcagc tctcccagcg ctacgtaccc gagaaggact cgcgggtcgt cgtgccgccc 360 ggcatggagg acgacgccga cctgcgccac atcctgaccg aggccgccga cgccgcccgc 420 gccacctaca gcgagctgct ggccaagctg gaagccaagt tcgccgacca acccaacgcg 480 atcctgcgcc gcaagcaggc ccgccaagcc gcccgcgcgg tgctgcccaa cgccaccgaa 540 acccgcatcg tggtgaccgg caactaccgg gcctggcggc acttcatcgc aatgcgggcc 600 agcgagcacg ccgacgtgga aatccggcga ctggccatcg aatgcctgcg ccagctcgcc 660 gccgtggccc ccgcggtgtt cgccgacttc gaggtgacca ccctggccga cggcaccgag 720 gtggcgacca gcccgttggc gaccgaagcc 750 52 762 DNA Mycobacterium leprae 52 gtggcccaga tcgcgccgct acgcgtgcaa ctaattgcca agactgagtt tctggcgcct 60 cctgacgtgt cgtggactac cgacgccgac ggtggttccg cgctggtcga attcgccggt 120 cgcgcctgtt atcagagctg gtcgaagccc aatccccgga ccgcgaccaa cgccgcgtac 180 attaaacaca tcattgacgt cgggcatgtt gcggtgctcg agcatgccag cgtttcgttc 240 tatatcagcg gcatctcgcg atcatgcact cacgagctga tccgacatcg gcatttctcc 300 tactctcagc tgtcgcagcg ctacgtgccg gaaaaggatg cccaggtcgt tgtgccaccg 360 gacatggagg atgacgacga acttcaacag attctgattg cggccgtaga ggccagccgg 420 gccacctaca ctgaactgct ggtcaagctg aacgccaagt taatggccgg tgagctcggc 480 gggaataggg cggtgttgcg gcgcaagcaa gctcgccaag ctgcccacgc ggtgctgccc 540 aacgccaacg agacccggat cgttgtgacc gggaactacc gggcatggcg gcacttcatc 600 gccatgcggg ccagcgagca cgccgacgtc gaaatccggc ggctggccat tgtctgcctg 660 cgccggctcg tcgacgttgc gcccgcagta ttcgctgatt tcgagatcac cgcacttgct 720 gatggtactg aggtcgcaac tagtccttta gccaccgaag ct 762 53 780 DNA Sulfolobus solfataricus 53 atgatctctg ttaaattagt ctcatatacg aatgatggag agaaagtcat tgctattgct 60 gctaagatga gtaggagtag aaaaggttgg gattatcatg agaaagacat gactgatgat 120 gaaattgaga catggatcag agatgcaatc cttcacggtt attggagtgt tcttgagcat 180 agcgtttata ctttttctat agaagaaatt tctcgtgtag cttcacatca gcttgtgagg 240 cataggattg caagttatac gcaaatgagt cataggtttg ctaagcctat agacgaatac 300 tacaaaccaa ttattccgcc atctgccaag aaaagaagta aagagttagt ggagaaagcg 360 tataaggaag cctatgacaa ctattacaca cttttagaaa gcggtgtacc agaggaagat 420 gcaagatatg tactaccaaa tggagtaaat acaaatattg tcgtcacaat gaatgctaga 480 gagttatata acttcttctc gcttcgactg tgttcaaggg cacagtggga gattagagca 540 atagcttgga aaatgttaga ggaggttaag aaagttcatc cccgactatt caaatatact 600 ggaccaaatt gtataataca tgaaaacttc ataagaaatg aaaatgaatc tataacttta 660 gaagatatat ttaaagatta caaattagag ttcatatctc aaagatgcat cgagggagta 720 ttgagagacg gaataaaaaa atgcatcata aattcgcgtt ctgtattgga taatattaaa 780 54 951 DNA Aquifex aeolicus 54 ttgatgaaaa tctacttaat gggctcggac cagaggatag ttaggtgtgc gagggtgtcc 60 tttgcaaagg actcatatgt cgatgaaaag agggacaaaa gactcattcg ctacctcttc 120 aagcacaggc acgcgtctcc ctttgagcac aacatcattg cctttgagtg gaaaaaggag 180 aagtggatag aactcctttc aaagctggaa aaccctaccg ttcaggttta ctactcaaat 240 ggctttgttt ttcttaactt aaggaacgct atcaacgtat gggaactttt gcccgacgca 300 gtcaaagaaa ggataaagga ggcttttcct accacttacg gagtaattca gcggagaggg 360 gaaattgagg atgaggaact ttattccctt ccctacacta aagataaagc ttacgtaaag 420 gaaaagattg agacgagttc tggctggatc gggcttgttg ataagctgga actggaaact 480 gatatggatt tttacacttt tgtggttgaa tgccctcttt ttgtagcccg tcagtggatg 540 aggcacaggt tcggctctta caacgaggtt tctaaaaggt acgtgggaaa ggagtttctg 600 gaattttatc tccctaaata cataagaaag caagcggaga aaaacaaaca ggcttctgtg 660 gatgaaccta tttctgagag tgaagttttt ataaaaaaaa tagaaaattt gataagcaaa 720 agcgtaaagc tctacgagga gataatagaa aagggcggtg caaaagagct cgcaagggga 780 gtgcttcccc agtttatgaa aacgaggttt tactggaccg ttccgaggat ttctcttgac 840 aacttcataa ccctcagaac tcacgaaggt gcacaaaaag agataaggga atttgcagaa 900 gctataaagg aaatggtagg atacagggga actgataaga agaacgtcat t 951 55 981 DNA Aeropyrum pernix 55 gtgtccttgg cagcgtccct ggaaaaagct ggcctcggca tctcggtcag gctactcgag 60 tatactggtg atggagagcg tattgtcgct gttgctagta aggtaagcct ctcccgcagc 120 ccggcggaga ggctgctggc cataggggaa gatgaagttg agacttggat actcgagacg 180 tttagaaggc agcatttcag cccctgggag cacagcgtct acacattcat ggttgagggt 240 ctctcgaggg tggctagcca ccagctagtt agacataggg ttgctagcta cacacaacta 300 agccataggt atagcgaggg ctacctcagg gaggccgccc tcaaggcctg cgaatccatc 360 ggcctagact gtccatcgaa accggctgag acggagggcg ggcggaaagc cgcatacagg 420 ctgtactctc aggctctgga gagagctgct cgagatttcg gggctagcga gaggtttgct 480 atagcggcga aggccttcgt aataccgcct accattctgg cgaggggtga tggcggtgat 540 ggtgtcgtgg aggcgtatct acggtctgca gcgatatact atagcctact ctcccggggg 600 gctaggcgtg aggatgcgcg gtatatcctg cccgatgcgc tgagaaccag aatcgttgtt 660 actatgaatg cacgggagct tatacaggtc ttcttccccc tgaggatgtg cactagagcc 720 cagtgggaga tacgccacat agcctggctg ctatggcgag agctatccag ggttcatcca 780 aggctgttca ggtgggccgg gcccagctgt gtgttaaggg agaacaccct taggacgacg 840 cccgccagct tatacagcta cctggagggc gtagagcggt ttacacagcc ccgctgccct 900 gagcttgttg agaacaaggc tataccggga tgcctcaggc aggcggcatc ggtggcgccc 960 cccggagacg gggagtacga a 981 56 798 DNA Halobacterium sp. 56 atggtaccgg cgcgtgggtt cggagtgttt ttgccaccgg caggcacacc gtctagcatg 60 cgcgtccgtc tcctcgaagc gactgagaac ccggaggaac tcatctgtca gagcgcccgc 120 aacgactaca tgtccgactg ggtcggtgac acgccactcg acaccgcgat ggcgtccgtg 180 gacggcgaca ccacagacga aaagctgtcg aacctcatcg cacagctgct cactcgcggc 240 cactacggcc ccttcgagca tccgtcggcg acgttcgcca tcgagggtgt gagccggtcg 300 tgtatggcgc agctcactcg ccaccgccac gccagcttcg acgtgcagtc gatgcgctac 360 gtcgcgttcg acgacgtcga tccggcggcg gttgcggagg gcgagctggt ggtgacgccg 420 ccgtcggcga ccgaccccga ctgggtcggc cgcaaccagg acgcgggcga catcgacgag 480 gagaccatgg ccgaacgcga ggcggtcttc caggcgtcgg tgcggcgcgc cgtcgaagac 540 taccaggaac tgctggggct cgggatgccc ccggaggacg cccggttcgt gctccccatc 600 gggaccgaag tgaacgtggt gatcacgctg aacccgcggt cgctgatgca cgtcgcggac 660 atgcgggccg ccgccgacgc acagtgggag atccgcgagc tcaccgaaca gctactcgat 720 gcggccgctc agtggtgtcc ccacaccttc gaatattacg acgcggagat gaaacaccgg 780 aagaaccggc tcgcgccc 798 57 795 DNA Borrelia burgdorferi 57 ttgaataaag aatataaaat tttggataat ggttttttaa aacttattga tttcatggga 60 gatgatagga gaatagttaa ggctgcaagg atttcttatc gagaagagag cgttaaaaga 120 aaagatgcgg aacttattga ctatttaata agaaatgggc acacaagccc attggagcag 180 gtggttttta catttcatgt taaagctcca atatttgttg caaggcaatg gatgaggcat 240 agaacggcaa ggattaatga agtttctgga tgctacagct tggcaagaga ggaattttat 300 gtccctttag aagaagattt aaagtgtcaa acttctagta atagctctga aaaagagttt 360 aagtctttgg aaaaattgtc ggacaaaata aagcatcatc aaaaacattc ttatgagctt 420 tatcaagata tgatcaatgc taatattcca aaagaactct caagaatagt tttgccctta 480 agtttatata ccgaatggta ttggcaaatt gatttaaata atctttttca ttttattaaa 540 ttgcgattag ccctagacag tccaaaagaa attaaagaaa attcgccaaa agaaatgcgc 600 gaatatgcta aagcattgat aagcatagta agagaaattg tgcccatcgc ttttaacagt 660 tttgaaaatc attttttaag aggaaagaga ttttcccacg aagagataat tgcaattatt 720 aatgctttgg atttaaataa gcttagtatg gatgctgaaa aattgaactt attgaaagat 780 aagctaggaa ttgat 795 58 915 DNA Treponema pallidum 58 gtgacgttgc gtacgcttca agccggtgtg gcggtcagta tcgctctgga tcgtgtgtgc 60 tttttctgtt ataacggggc ggtggcacac tgtgtagtag aagctgccga agatattttg 120 gaccggcgtt tttctgtatt ggataagggt ttcgtgcgtt tgatagatta cctgggaggg 180 gatgcacgca ttgtgcaggc agcgcgtgtt tcttacggtg cggggactag gactgcgcgt 240 gacgatgcgg cgcttatcga ttttctttta cgcaataagc atacgtctcc ttttgagcag 300 gtggtcctta ccttccatgt acgtgcaccg atttttgtcg cgcgtcagtg gatgcggcat 360 cgcactgctc gcatcagtga ggtgtctagt cgttattcgc ttcttagtca tgactgttat 420 gttccgcagg aaacttcagt tgcagttcag tccacgcgta acaagcaggg ccgcgcgtcc 480 gaaggtatct ctcctgaaca gcagcaggaa gtgcgggcag cgtttgaagc tcagcagaaa 540 gcggcgtgtg ccgcttacga cgcattgatt caaaagaaca tcgcgcggga gctagcgcgt 600 attaacgtgc cgctttcgct ttacaccgag tggtattggc agattgattt acacaatctt 660 tttcattttt tgcgtttacg tgcgagcgct catgcgcaag cagagattcg tgcgtatgca 720 gaggtaatca ttgaaattac ccgtgcagtt gcgccgtgcg ctaccgcctc ttttgaaaat 780 catgaaaaag atggggtgca gttttcaggg cgggagtttg ctgcgcttaa ggccttactg 840 gctggagagg gtctctccct tgaggggaag gaacgtgcgc gctttgaaga aaaattacgc 900 tctggcctgc agcag 915 59 1545 DNA Thermoplasma volcanium 59 atggagtttt cgaacgccga gagagatgta ttcttaataa aaatagagaa aatgatagac 60 agaggtgccc tgatgtcccg ttacagtagg gcatctgatc ctgatatacg atccgtttat 120 gaaaaagagt tcaaaacagg ggcgaagagc ggcgaagagt tttacaggag gattttcctt 180 gaatatggcg atgaatcgat tgccgaactt acaactgcac agatgggcat acagaacgtt 240 tcaaacgtag catccaagat aattgaggag ataaggatcg gcctttctta tcttgaaaaa 300 tcgacaagat atgtacgtta cgacaaaaag gttgatggca gataccttta catttcacca 360 gagaggatag gaatatcggg ggatgatgct aaggattacg tgcaactatg cgacaatctc 420 ttcgaattct attccaaggc attgcctcag gtagaagatt accttaggca taagtttccc 480 caagacaagt tggtattcca aaatgctggt ggaaagactc taaccgaaat ggatcaaaac 540 gagaaaaaaa tagcagaaag atcctatatc aatgcagtta ggtcacgtgc gctggatgat 600 gtaaggtata ttttgccagc atcgacgctg acgaacatcg gtattagcgg caacggaagg 660 gccctaatcc atcttataca gaagttaatg gaatatgaaa tcccagaaac tacaaagctt 720 gcaaaggaca tatacgatga actaaagcca gaattgccac agcttataga tgatgcacta 780 tctggccacg gccttgagat cataaatttt aagaagaatc tcatgggctt gtttccctat 840 gatctcactg gcaattttga aagaattcgc ctactctcat atggcgaaga ggataaagag 900 ctcaggaagg tggcatccct catcgaatat ccattccatg gggacgcggc ttcactttac 960 tcccgcagct ctgaatccta cgtaaaatat atgaaagagc tcatagagtc gataaggtcg 1020 ttgagggcaa acaggaggat gaaaccagga agggcattcg aatcagtaaa ttatgttttt 1080 gagctaaact taaactacgg ctcattccgt gatctacaaa gacaccgctt tcttggtata 1140 attaggaaac cgcttacggc agcatacgga tacgacactc ctccagtgat ctctgctata 1200 gatgaattga aaacgcagta tgatgagttg atggcgaatt cttcttcctt ttaccaaagg 1260 ctaagggaaa agtatgggcc ctggatatca cagtatgttg ttccgttcgc ttttaaatac 1320 ccgatcacgt tttccacgaa tctctctgaa gtaacgtatt ttgtagaact caggagtact 1380 gcgcaggccc atttcgatct aagagacata gcggtaagca tgtacaggga agtctcgaaa 1440 gtgcacccaa ctctgtccag aatcataaaa tttgttgata ctgcagatta tccgcttggc 1500 aggttatctg ctgagtttag gaaggaatcg aagaaggccg gtatt 1545 60 732 DNA Pyrococcus abyssi 60 atggttcgag ttacgctcgt caattacaca aggagacccc tcgaaacgat aacatgggcg 60 gccctcgtaa gttactggga cgagtggagc actgagtcat ttgaaaagat caacgaggat 120 gatgtaaaag ctcacctccc caggatactt ggttatggac acgagagcat tttagagcac 180 gcaacgttca cattctcaat agagggctgc agtagggttt gcacgcacca gttggtgagg 240 cataggatag ccagctacac tcagcaaagc caacgctaca ttgttctcaa cgaggagaac 300 gttgaagaaa ccttcgtgat accagagtcg ataaagaagg acagagaact ctatgaaaaa 360 tggaagaaag ctatggcgga gacaataaag ctctacaagg aaagtttaaa gagaggtatt 420 caccaagaag atgctaggtt catccttccc caggcggtta ggagtaaaat agtcgttacg 480 atgaacctta gggagctcaa gcacttcttc ggcttaagat tgtgcgagag ggcacagtgg 540 gaaatcaggg aagttgcctg gaagatgctt gaggaaatcg caaagaggga agagctaagg 600 cctataataa agtgggctaa gcttgggcct cgttgtattc agttaggtta ttgccccgaa 660 agggaattaa tgcccccagg atgcttcaag agaacaaggg agagatggat gaaacttctc 720 gaaaaaccct ta 732 61 1587 DNA Chlamydia trachomatis 61 atgttgagca aagagggtgg tttttctgag gagcaaagag cgcgtttatc gcattttgtg 60 acgaatttag actcgcctat atttgctttg aaaaaccttc cagaagtggt taaaggcgct 120 ttattttcaa aatattccag atcgactctg gggttgcgag cgcttctttt gaaagaattt 180 ttagatgggg aaggcggtaa tttccttgat gatgaccaac aagattgtga gttggggatc 240 caaaaagctg cggacttcta tcgtcgcgtt ttagacaact ttggtgatga ttctgttgga 300 gagttgggag gagcgcatct tgctctggaa caagtatcca tgctcgcagc aaaaatttta 360 gaagatgctc ggattggagg gtccccccta gaaaaatcgt ctagatacgt ttatttcgat 420 caaaaagtta acggggagta tttatattac cgagacccta ttttgatgac ctcggccttt 480 aaagacgtct ttttggatac ttgtgatttc ctattcaaca catactccga tcttatccct 540 caggttcgtt cccatttcga gaaactatac cctaaagatc cagaagtttc tcaatcagcg 600 tatacagttt ctttacgagc taaagtatta gactgtttac gaggtttgct acctgcagcg 660 acactcacaa atttaggttt ttttggtaat ggccggtttt ggcagaactt gctacaccgt 720 ttgcaagaca atagtttggt tgaggtacgc aatattggag agcagtcctt aacagaatta 780 atgaaaataa ttccctcttt tgtaagccgc gcagagtctc atcattatca tcaccaagct 840 atggtggatt accgtcgggc tttaaaagaa caattaaaaa gttttgcaca tcgttacggg 900 gaagagagag aaatttcgaa agaggctggt gtaaaattag tatacggaga tccagacggg 960 ttatacaaaa ttgctgcagc ctacatgttc ccctactcgg aacacactta tgcagagctg 1020 ttagatattt gtcgcaatat tcctaatgaa gatctcatgc gtatcttaga gtcgggagct 1080 tctttccgag agaatcggcg gcacaaatcc cctcgcggat tggaatgtgc tgagtttgct 1140 tttgatatta cagcggattt tggagcctat cgggatttac aaagacatcg tatcctaact 1200 caagaaagac agcttttgac gacaaaattg ggttacacga tgccttcaca attgatcgac 1260 actcctatgg aagctccctt cagagaagct atggaaaaag ctgatcaagc gtatcgtcta 1320 atagcagaag agttcccaga agaagcacaa tatgtggttc ctttagctta caatattcga 1380 tggcttttcc atatcaacgc tagaggtttg cagtggcttt gtgagttacg ctctcaacca 1440 caagggcatg aaagctatcg taaaattgct atagatatgg ctagagaggt tattcagttt 1500 catccagctt acgagctgtt cttgaagttt gtcgactact cagagactga cctaggaaga 1560 ttacaacaag aatcgcgtaa aaagtct 1587 62 1481 DNA Dictyostelium discoideum 62 atgggtcttg atattcaaac agaaatcgat aaaattgtaa ttgaaaaggt taaaccagaa 60 gttgaatact atgacgttat gggagggtca catagatggg aagtcaaagt gcacgaccat 120 ggtaaagttg cattagtcga tactatgcca agattagcac cagttggtca aaccgccgat 180 ttctcaattt gtcaagcagc aagagtgtca tatggtgccg gtactaaaaa agtcactgaa 240 gataaaggtt taattcgtta tctttataga catcaacata cttgtaagta tataaataaa 300 taaataaata atgaaaaaaa aaaaaaaaaa aaaaaaaaag tttattaaca atttttaatt 360 tttaattttt tagcaccatt tgaaatggta gaatttaaat ttcattgtgt aatgccagta 420 tttatcgcac gtcaatggat tagacataga acagcaaatg tcaatgaata tagtgcacgt 480 tattcagtat taccagataa attttatcat ccatcaattg aggaagttag aaagcaatca 540 acatcaaata gacaaggtgg tgaagaagca ttggagccaa aaaccgccca agagttttta 600 gactatttag ataaagttga agaaaactat aaaacctata atgaactctt agagaaaggt 660 ttatcacgtg aattgggtcg tattgggtta ccagttagta tctacactga atggtattgg 720 aagatcgatt tacataatct tttccatttc ttaagacttc gtatggatag tcattctcaa 780 aaagagatta gagattatgc aaatacaata tttgctctca ttcgtccaat tgtaccagtt 840 gcttgtgaag gcatttatag attatgcttt tgaaagttta aacttacacg tttagaaatt 900 gaagcattcg tactggtagt ccactcaata ctacaaataa aagagaaata gaagaatttg 960 aagaaaagaa gaaattatta ttcccaaata ctcaagccta aaaaaaacac aaaacacaaa 1020 cacaattatt taattaatta aataatctca tccttttttg tatataaaat aaataaaaca 1080 aatattatta atgtattgtt cattttgtta tgaaacaatt cattttctaa ataaaatata 1140 tatgttatca caataaaatt atttatttat atttattttt taaaaaatta atcatcaaga 1200 atgatttcag aatcatcttg gtattgttgt tgttgaggat tttgttgttg ttgaatgcta 1260 ttattattac gttgatttaa ttgttctttt tgtaattctt ctaaatttcc agaataggtt 1320 gaaacatcca attcttttaa agtttcattt gtaatttcag tatcttgggt tattattttt 1380 attaaattat catagatttt attttgtgga ttatcaaatg ctggtggtaa attggataat 1440 ggactgtttg aatttggtaa tattcaatat gtattcttta c 1481 63 669 DNA Roseophage 63 atgacacaga ttgaagcaac atacatcgac cacatgggtt cagacctttc agtcgtcaac 60 gcagcacggg ttagctttgg taagaagagt gagtgggttt actgtggtca gtcagacggc 120 agagacaaag gtttatctgg ccgtgacacc aagctcatca agtacctagc caagcacaag 180 cacatcagcc ccttcggaca tgccttcgca agcttccacg ttaaggctcc aatctttgta 240 gcacggcagt tggtgaagca taagttccta cgttggaatg agatcagccg ccgttacgtt 300 gatgatgaac ctgagttcta cacacctgat gtatggcgtg gacgtagtgc tgacaagaag 360 cagggtagtg atggtgtagt taacccagag tataaccccc aatacctaga caacaaaatc 420 aagtttgcct atctacaagc acttgacata ggtatttcac ctgaacaagc acggatgctg 480 ttgccgcagt ccacaatgac tgagtggtat tggtcaggta gccttgatgc ctttgctgac 540 atgtgccgcc ttcgttgtaa ggaagacaca cagtatgaga gccgtgttgt ggctgaccag 600 atcagtgaga agatggcaga cctgtatcct gtctcatggg ctgcacttat ggaaggagaa 660 aagcaatga 669 64 650 DNA Chorella virus 64 atgtccgcaa agctcatttc cgttaccaag ccagtcgtgg agggtgttaa cactgcagag 60 gaactaattg cgtatgctgc ccgtgtctcc aaccccgaaa accaaattaa caacaagact 120 gcatcagggc ttctaaagta ttgtatccgc cacaagcatt ggtcaatctt tgagactgcg 180 ttcatgactc ttgaactgaa gacgtctcgc ggtatcgcag ctcaggttct tcgccatcgg 240 agcttccact tccaggaatt ttcccagagg tacgcatctg tgatggaaac tccaccacct 300 catcaagcac gattccaaga tcataaaaat cgccaaaatt ctctggacac cgtcccggaa 360 gatgatcaaa cgtggtgggc aaccgaacaa gaaaaactgt atgcacagag catggagctc 420 tataacaagg ctctcgaaaa gggaattgca aaagaatgtg caaggtttat tcttcctctg 480 agtacaccaa ctactattta catgtcgggt acgatcaggg attggatcca ttacatcgaa 540 ctgcgcactt caaacgggac acaacgagaa cacattgatc ttgcaaatgc ttgcaaagaa 600 attttcatta aggaattccc cagcattgca aaagcacttg attgggtctg 650 

1. The use of a THYX polypeptide comprising the following amino acid sequence: X₁HR(X)₇S, wherein: X₁, represents the amino acid R (Arginine or Arg) or K (Lysine or Lys), and (X)₇ is a chain with seven consecutive amino acids wherein each X represents, independently from each other, any of the 20 naturally occurring amino acids, in an in vitro synthesis method of the thymidine 5′-monophosphate (dTMP).
 2. The use according to claim 1, characterized in that the THYX polypeptide is selected amongst polypeptides comprising amino acid sequences SEQ ID N^(o)1 to SEQ ID N^(o)37.
 3. The use of a nucleic acid coding for a THYX polypeptide according to one of claim 1 or 2, for producing said THYX polypeptide.
 4. The use according to claim 3, characterized in that the nucleic acid is selected amongst nucleic acids comprising nucleotidic sequences SEQ ID n^(o)44 to SEQ ID n^(o)64.
 5. A method for screening a candidate molecule or substance interacting with a polypeptide according to the invention, said method comprising the steps of: a) contacting a polypeptide according to one of claim 1 or 2 with the candidate substance or molecule to be tested; b) detecting the complexes optionally formed between said polypeptide and said candidate substance or molecule.
 6. A a set or a kit for screening a candidate molecule or substance interacting with a polypeptide such as defined in one of claim 1 or 2, said set comprising: a) a THYX polypeptide such as defined in one of claim 1 or 2; b) if need be, means necessary for detecting the complex being formed between said polypeptide and the candidate molecule or substance.
 7. A method for screening an anti-bacterial or anti-viral compound in vitro in an acellular system, characterized in that said method comprises the steps of: a) preparing a cell lysate from a culture of cells expressing a THYX polypeptide such as defined in one of claim 1 or 2 in the absence of a polypeptide coded by a thyA gene; b) adding to the cell lysate obtained in step a) the inhibiting compound to be tested; c) comparing the thymidylate synthase activity respectively in the cell lysate as obtained in step a) and in the cell lysate as obtained in step b); and d) selecting the candidate compounds for which some inhibition of the thymidylate synthase activity has been detected.
 8. A kit or a set for screening a thymidylate synthase inhibiting compound, characterized in that it comprises: a) a composition comprising a THYX polypeptide such as defined in one of claim 1 or 2 in solution or under a lyophilized form; b) optionally one or more reagents required for quantifying the thymidylate synthase activity.
 9. The use of a nucleic acid coding a THYX polypeptide such as defined in one of claim 1 or 2 in a method for screening anti-bacterial or anti-viral compounds.
 10. A kit or a set for screening an anti-bacterial or anti-viral compound, characterized in that it comprises: a) a recombinant expression vector comprising a nucleic acid coding a THYX polypeptide such as defined in one of claim 1 or 2, under the control of a functional promotor in a host cell wherein its expression is being sought or in a host cell transfected with such a recombinant vector; b) optionally one or more reagents required for quantifying the thymidylate synthase activity.
 11. The use of an antisense oligonucleotide specifically hybridizing with a nucleic acid coding a THYX polypeptide such as defined according to one of claim 1 or 2 for inhibiting or blocking the transcription and/or the translation of said nucleic acid.
 12. The use of an antibody directed against a THYX polypeptide such as defined according to one of claim 1 or 2 for inhibiting or blocking the thymidylate synthase activity of said polypeptide.
 13. An anti-bacterial or anti-viral pharmaceutical composition comprising, as an active principle, an antisense oligonucleotide specifically hybridizing with a messenger RNA coding a THYX polypeptide such as defined in one of claim 1 or 2, in association with one or more physiologically compatible excipients.
 14. The use of an antisense oligonucleotide specifically hybridizing with the messenger RNA coding a THYX polypeptide such as defined in one of claim 1 or 2 for producing an anti-bacterial or an anti-viral drug.
 15. An anti-bacterial or anti-viral pharmaceutical composition comprising, as an active principle, an antibody specifically raised against a THYX polypeptide such as defined in one of claim 1 or 2 in association with one or more physiologically compatible excipients.
 16. The use of an antibody specifically raised against a THYX polypeptide such as defined in one of claim 1 or 2 for producing an anti-bacterial or anti-viral drug.
 17. The use of a nucleic probe or primer hybridizing with a nucleic acid coding a THYX polypeptide, such as defined in one of claim 1 or 2, in a method for detecting a bacterium or a virus.
 18. The use of a nucleic acid coding a THYX polypeptide, such as defined in one of claim 1 or 2 as a selection marker for a genetic transfection, transformation or recombination event of a host cell or a host organism.
 19. A reaction medium for the thymidylate synthase activity of the ThyX, characterized in that it comprises reduced flavins and CH₂H₄folate.
 20. A medium according to claim 19, characterized in that the reduced flavins are obtained through reduction in situ of oxidized flavins.
 21. A medium according to any of claim 19 or 20, characterized in that the oxidized flavin concentration varies from 50 μM to 1 mM, and is preferably 0.5 mM.
 22. A medium according to any of claim 19 to 21, characterized in that the CH₂H₄folate concentration varies from 50 μM to 2 mM, and is preferably 1 mM.
 23. A medium according to any of claim 19 to 22, characterized in that it additionally comprises dUMP in a concentration ranging from 1 μM to 800 μM, preferably 500 μM.
 24. A medium according to any of claim 19 to 23, characterized in that the oxidized flavins are flavin mononucleotides and/or flavin adenine dinucleotides.
 25. A medium according to any of claim 19 to 24, characterized in that the flavin reduction occurs through chemical, enzymatic, photochemical or electrochemical route.
 26. A medium according to any of claim 19 to 25, characterized in that the flavin reduction occurs with NADH and/or NADPH.
 27. A medium according to claim 26, characterized in that the NADH concentration ranges from 0.1 to 1 mM, and the NADPH concentration from 0.5 to 5 mM.
 28. A method for screening an anti-bacterial or anti-viral compound in vitro in an acellular system, characterized in that said method comprises the steps of: a) preparing a cell lysate from a culture of cells expressing a THYX polypeptide in the absence of a polypeptide coded by a thyA gene and comprising a medium according to one of claim 19 to 27; b) adding to the cell lysate obtained in step a) the inhibiting compound to be tested; c) comparing the ThyX thymidylate synthase activity respectively in the cell lysate as obtained in step a) and in the cell lysate as obtained in step b); and d) selecting the candidate compounds for which some inhibition of the ThyX thymidylate synthase activity has been detected.
 29. A screening method according to claim 28, characterized in that dTMP and ³H are used as markers for the ThyX thymidylate synthase ThyX activity.
 30. A kit or a set for screening a ThyX thymidylate synthase inhibiting compound characterized in that it comprises: a. a composition comprising a THYX polypeptide as well as a medium according to any of claim 19 to 27, in solution or under a freeze-dried form; b. optionally one or more reagents required for quantifying the ThyX thymidylate synthase activity.
 31. A kit or a set for screening an anti-bacterial or an anti-viral compound, characterized in that it comprises: a. a recombinant expression vector comprising a nucleic acid coding a THYX polypeptide under the control of a functional promotor in a host cell wherein its expression is being sought or a host cell transfected with such a recombinant vector; b. a medium according to any of claim 19 to 27; c. optionally one or more reagents required for quantifying the ThyX thymidylate synthase activity. 